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CCCAGGGAAG TGCAGATCCA CGTGCATATT ttaAATTATT TTGGTAACGG 1140 

^™ SSSS S5SE " Stgcagaag 1200 

AGTGC ATAGG T.i.AAACAC CGG^ACTGCC TTGCAGTC-AG CATTGCAGAT 1260 

tgatgcctgg atgcctgttg cagctgttta cc^actgcc cagccacc tcccggaaca 1320 

AGGGGTGGGG TG-TTTGTGa CGTGTTCCCA =£± agtaGATGAG TTACTATGAA 1360 

CATCTCACCT GCTGGGTACT TTTCAAACCA TC-AG.AGT ^ GTTGGGCAAA 1440 

acagagaagt tcctcagttg gatattctca t-^gtct tttt tcctttctat x50O 

CTATGATAAA GCATCTCTAX "^AAATTA T^.JCTTGTT ^ CTTCAATCTT 1S60 

AGCACCACTT A7TGCAGCAG GTOjMGCTC ^rGTGGOC GTAAACAGTA 1620 

TTAAAGCTTC T77GCAAA7A CACTCACTTG ^"J*^" AAAGAATTCC GCCTATTCAT 1680 
CTTACCTTTG ATCCCAATGA ^ATCGAGCAT -^GTTGTA J"* CAAGTAATAG 1740 

ACCATGTAAT ^TTTTAC JCCCCCAGTC £;££7TT AAACTGTGCA 1BO0 

ACTTTGGCCT CAGCCTCTTG TGTAOGTAT £^AATA^ CAAATGT , A3 AAAATGAGGA 1860 
TATGATTAT7 ACA7TATGAA A GAGACATTC J^atXcT GCAGGTGTCC TTAAAAAAAA 1920 
GTGCGTGTGC T7TTATAAAT ^AAGTGA^ C-^WJJJ ?SaT**T TCCTATTTGG 1980 
AAAAAAAAAG 7AATATAAAA AGGACCAGGT ctTTCTAAAC ATAAGGCTCT 2040 

TAAACAGTTA CATTTTTATG AAGATTACCA ^GCTGCTGA CTTT GTCTGGGTAA 2100 

ATTGTCTTCC TG7ACCATTG CATTTCCTCA ^^CAATTT ^ GA-VTGCAGAG 2160 

ACTATTCAAG AAA7GGCTTT GAAATACAGC A.GGGAGCTT GTCTGAJX 222 0 
TTGCACTGCA AAATGTCAGG AAATGGA7GT CiCTCMgM «CCUCT^ GCTQAATGTT 226 0 
ATATGTGTAT A7AGTAAGCA GTTTCCTGAT ^CAGCAGGC ^ ACTGCGGATT 2340 

GTGTTGCCGG AGACCTGTAT TTCTCAACAA ^TAAGATGG GTTTCATCAT 2400 

TTAATACATT 77CAGCAGAA GTACTTAGTT J-CTCTACC £™ CCTTTT7TTC 2460 

TTTTAGATGT TATACTTGAA ATACTGCATA ™T££ ££r TGGTC 77AAAC7GCA 2520 

SSSSS SS SEK S sgsjs SSSS S! 
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(57) Abstract: The present invention relates to novel 
methods of producing transgenic avians, preferably 
chickens, wherein the incorporated transgene may 
he expressed as a constituent protein of the white 
of a hard-shell egg. The present invention provides 
sperm-mediated transfer for the introduction to an avian 
egg of a transgene encoding a heterologous polypeptide. 
The avian sperm may be irradiated before the transgenic 
gene is incorporated therein. Transgenic genes may 
be incorporated into avian sperm by lipofection, 
electroporation, restriction enzyme mediated integration 
(REMI) or similar methods. The modified avian 
sperm may then be delivered to an avian oocyte by 
microinjection, intracytoplasmic sperm injection (ICSI) 
or artificial insemination, or by natural coitus after 
the modified avian sperm are returned to a male bird. 
Heterologous nucleic acid may be integrated directly 
into the genomic nucleic acid of the oocyte or after first 
integrating the heterologous nucleic acid into the nucleic 
acid of a male germ cell and subsequent delivery of the 
transgenic male germ cell to an oocyte. Alternatively, 
the heterologous nucleic acid may be a episome within 
the sperm, or within the derivative zygote formed by 
the fusion of the sperm and the recipient oocyte, and 
may replicate independently of the zygote genome. 
Co-segregation of the episome with the replicated ooctye 
genome into all of the daughter cells may be induced by 
the heterologous nucleic acid having a centromeric body 
_deriyed from, for example, a chromosome o f a chicken. 
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1 pfl n m THE TNVKNTION 

,0 The present invention relates to methods of producing atransgenic avian by 

^ducingannoleioaoidencodingahetorologoospeotoinintofte genome of an avan 
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depoLd into the white of an avian egg, said avian generated by sperm-medtatod 
died ftomme transgenic avians and toe heterologous protons tsolated drerefrom. 

20 

2. parKKROWTO 

The field of transgenics waa initially developed to understand the acuon of a single 
— infte^Lof the whole anima! and dre phenomena of gene acuvation, express™, 
g 7T?In ^technology has alao been usedtopnaduce models for various diseases 
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(KoMer andMilstein, 1975,^*256:495-497). Although various strategies have been 
proposed to overcome the deficiencies in antibody yield (.e.g. , engineering single-chain 
antibodies(scAb)compri^^ 

method has been proven entirely satisfactory in elevating antibody yields to the levels 
5 desired for adequate commercial production. 

The industry has been experimenting with transgenic animals that can express, for 

the protein in an active form while incorporating postradiational modifications, such as 

glycosylatior^typi^^ . . 

10 heterologous nucleic acids have been engineered so that an expressed protein may be j oined 
to a protein or peptide that will allow secretion of the transgenic expression product into 
milk or urme, from whichme^ 

limited success, however, and may require lactating animals, withthe attendant costs of 
maintaining individual animals or herds of large species, including cows, sheep or goats. 

15 

Avian Transgenics 

One transgenic system that holds potential is the avian reproductive system. The 
exogenous protein can be produced in the white of an avian egg from which it may be 
readily purified. (MacArthur, PCT Publication WO 97/47739). The production of an avian 

20 eggb e^wimformationofala, g eyolkmtheovaryoftheher, The unfertilized oocyte or 
ovumispositionedontopoftheyolksac. After ovulation, the ovum passes into the 
infundibulum of the oviduct where it is fertilized, if sperm are present, and then moves into 
the magnum of the oviduct, lined with tubular gland cells. These cells secrete the egg-white 
proteins, including ovalbiimin, lysozyme, ovomucoid, ovotransferrin, conalbumin, and 

25 ovomucin, into the lumen of the magnum where they are deposited onto the avian embryo 

and yolk. , 

The hen oviduct, for example, can serve as an excellent protein bioreactor because 
of the high levels of protein production, the promise of proper folding and post-translation 
modification of the target protein, the ease of product recovery, and the shorter 
30 developmentdperiodofcWckenscomparedtoomerpotentialarumalspecies. The 

economic advantage of breeding flocks of transgenic birds laying eggs expressing 
exogenous proteins would be significant when compared to more traditional animals, such 
as cows, sheep or goats, producing heterologous protem m milk. What is needed, however, 
is an efficient method of introducing a heterologous nucleic acid into arecipient avian 
35 embryonic cell. 
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Vectors 

Genetic information has been transferred to avian embryos using vectors. 
Bosselman et al. in U.S. Patent No. 5,162,215 describes a method for introducing a 
rephcation-defective retroviral vector into a pluripotent stem cell of an unincubated chick 
5 embryo, and further describes chimeric chickens whose cells express a heterologous vector 
nucleic acid sequence. However, the percentage of Gl transgenic offspring (progeny from 
vector-positive male GO birds) was low and varied between 1% and approximately 8%. 
In addition, the use of viral vectors poses limitations, including limitations on transgene size 
and potential viral infection of the offspring, tons, posing significant regulatory issues for 
10 production of biologies. 

Similarly, Jaenisch reported that while retroviral vectors did transfer genetic 
information to embryos, the resulting animals were mosaics with gene insertions at various 
loci in the genomic nucleic acid. (1976, Proc. Natl Acad Sci. USA 73: 1260-1264). The 
transgenes were also differentially expressed in the different tissues of each animal. 
15 (Jaenisch, 1980, Cell 19: 181-188). 

Nuclear Transfer 

Nuclear transfer from cultured cell populations is another route to produce 
transgenics, wherein donor cells may be sexed, optionally genetically modified, and then 

20 selected in culture before their use. The resultant transgenic animal originates from a single 
transgenic nucleus and therefore, mosaics are avoided. Nuclear transfer from cultured 
somatic cells also provides a route for directed genetic manipulation of animal species, 
including the addition or "knock-in" of genes, and the removal or inactivation or "knock- 
out" of genes or their associated control sequences (Polejaeva et al, 2000, Theriogenology 

25 53:117-26). 

Two types of recipient cells are commonly used in nuclear transfer procedures: 
oocytes arrested at the metaphase of the second meiotic division (MIT) and which have a 
metaphase plate with the chromosomes arranged on the meiotic spindle, and pronuclear 
zygotes. In agricultural mammals, however, development does not always occur when 

30 pronuclear zygotes are used, and, therefore, MH-arrested oocytes are the preferred recipient 
cells. Enucleated two-cell stage blastomeres of mice have also been used as recipients. 

After enucleation and introduction of donor genetic material, the reconstructed 
embryo is cultured to the morula or blastocyte stage, and transferred to a recipient animal, 
either in vitro or in vivo, and developed to term. (Eyestone and Campbell, 1999, J. Reprod. 

35 Fertil Suppl. 54: 489-97). Double nuclear transfer has been reported in which an activated, 
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previously transferred nucleus is removed from the host unfertilized egg and transferred 
again into an enucleated fertilized embryo. Activation (initiation of development) is most 
often induced chemically. Cultured cells can also be frozen and stored indefinitely for 
future use. 

5 Although gene targeting techniques combined with nuclear transfer hold tremendous 

promise for nutritional and medical applications, current approaches suffer from several 
limitations, including long generation times between the founder animal and production 
transgenic herds, and extensive husbandry and veterinary costs. It is therefore desirable to 
use a system where cultured somatic cells for nuclear transfer are more efficiently 

10 employed. 

Sperm-Mediated Transfection Mechanism 

A promising method for producing transgenic animals is the stable transfection of 
male germ cells in vitro and their transfer to a recipient oocyte. PCT Publication WO 

15 87/05325 discloses a method of transferring organic and/or inorganic material into sperm or 
egg cells by using liposomes. BachiUer et al. used Lipofectin-based liposomes to transfer 
DNA into mice sperm, and provided evidence that the liposome transfected DNA was 
overwhelmingly contained within the sperm's nucleus. (1991, Mol. Reprod Develop. 30: 
194-200). However, no transgenic mice could be produced by this technique. 

20 Similarly, Nakanishi and Iritani used Lipofectin-based liposomes to associate 

heterologous DNA with chicken sperm, which were in turn used to artificially inserninate 
hens. (1993, Mol Reprod. Develop. 36:258-261). Although the heterologous DNA was 
detectable in many of the resultant fertilized eggs, there was no evidence of genomic 
integration of the heterologous DNA either in the DNA-liposome treated sperm or in the 

25 resultant chicks. 

Heterologous DNA may also be transferred into sperm cells by a process called 
electroporation that creates temporary, short-lived pores in the cell membrane of living cells 
by exposing them to a sequence of brief electrical pulses of high field strength. The pores 
allow cells to take up heterologous material such as DNA while only slightly compromising 

30 cell viability. Gagne et al. discloses the use of electroporation to introduce heterologous 
DNA into bovine sperm subsequently used to fertilize ova. (1991, Mol. Reprod. Develop. 
29: 6-15). However, there was no evidence of integration of the electroporated DNA either 
in the sperm nucleus or in the nucleus of the egg subsequent to fertilization by the sperm. 

Yet another method initially developed for integrating heterologous DNA into yeasts 

35 and slime molds, and later adapted to avian sperm, is restriction enzyme mediated 
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integration (REM), which utilizes a linear DNA derived from a plasmid DNA by cutting 
that plasmid with a restriction enzyme that generates single-stranded cohesive ends. 
(Shemesh et at, PCT International Publication WO 99/42569). The linear, cohesive-ended 
DNA together with the restriction enzyme used to produce the cohesive ends is then 

5 introduced into the target cells by electroporation or liposome transfection. The restriction 
enzyme is then thought to cut the genomic DNA at sites that enable the heterologous DNA 
to integrate via its matching cohesive ends. (Schiestl and Petes, 1991, Proc. Natl. Acad. 
Sci. USA 88: 7585-7589). Although Shemesh described transgenic birds that were resistant 
to Infectious Bursal Disease, there was no evidence of expression or deposition of a 

10 heterologous protein in the oviduct for deposition onto egg whites. 

What is needed, therefore, is an efficient method of generating a transgenic avian 
capable of expressing a heterologous protein coded by a transgene, particularly in the 
oviduct for deposition into egg whites. 



15 3. STIMMARY QE THE INVENTION 

The invention provides methods for the stable introduction by sperm-mediated 
transfection of heterologous coding sequences into the genome of an avian, preferably a 
chicken, and expressing those heterologous coding sequence to produce desired proteins 
and/or to alter the phenotype of the transgenic avian. Synthetic vectors and gene promoters 

20 useful in the methods are also provided by the present invention, as are transgenic avians 
that express a heterologous protein and avian eggs, preferably chicken eggs, containing a 
heterologous protein. In a preferred embodiment, the vectors useful in methods of the 
invention are not eukaryotic viral, more preferably not retroviral, vectors (although the 
vectors may contain transcriptional regulatory elements, such as promoters, from eukaryotic 

25 viruses). In other embodiments, however, the vectors are retroviral vectors. 

One aspect of the present invention is a method of producing a transgenic avian, 
preferably a chicken, by introducing in an avian oocyte at least one transgene encoding at 
least one heterologous polypeptide by sperm-mediated transfection. The method comprises 
first, isolating an avian sperm, second, incorporating a transgene into the avian sperm, and 

30 third, delivering the modified avian sperm to an avian oocyte. In one embodiment, the 
avian sperm is irradiated with gamma rays before the transgene is incorporated therein. 

In one embodiment, the transgene is injected directly into the testis of a male avian 
and incorporated in the avian sperm. The modified sperm is then delivered to the avian 
oocyte by mating the male avian with a wild type or transgenic female avian. 

35 
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In another embodiment, the transgene is incorporated in the avian sperm in vitro by 
lipofection, electroporation, restriction enzyme mediated integration (REM!) or similar 
methods. In a preferred embodiment, the modified avian sperm is then delivered to the 
avian oocyte by natural coitus after the modified avian sperm are returned to the testis of a 

5 male avian. In another preferred embodiment, the modified avian sperm is delivered to the 
avian oocyte by microinjection (e.g., intracytoplasmic sperm injection (ICIS) or standard 
artificial fertilization methods). The resulting transgenic embryo can then be transferred to 
the oviduct of a recipient hen for development and to be laid as a shelled egg (or, 
alternatively, cultured ex vivo). The shelled egg is incubated to hatch a transgenic avian that 

10 has incorporated, preferably integrated into its genome, the selected nucleic acid. In 
preferred embodiments, the avian sperm is first irradiated before incorporated with the 
transgene. 

In certain embodiments, a Iransgene comprising a heterologous nucleic acid may be 
integrated directly into the genomic nucleic acid of an avian sperm and subsequently 
15 delivered to an avian oocyte. When me heterologous nucleic acid is directly integrated into 
the genome of the avian sperm which then fertilizes an avian oocyte, the resulting 
transgenic embryo will include the transgenic heterologous nucleic acid in all of its cells. In 
preferred embodiments, the transgenic heterologous nucleic acid is incorporated into at least 
one embryonic cell, preferably the germinal disk of an early stage embryo, that then develop 
20 into a transgenic avian. 

Alternatively, the heterologous nucleic acid may be an episome within the modified 
avian sperm, or within the derivative zygote formed by the fusion of the modified avian 
sperm and the avian oocyte. The episome may replicate independently of the zygote 
genome. When the heterologous nucleic acid is episomal with respect to the genome of the 
25 transgenic zygote, and the episomal nucleic acid has a centromeric body, most, if not all, of 
the cells of the transgenic embryo will include the heterologous nucleic acid. Accordingly, 
in preferred embodiments, the transgene further comprises centromere and/or telomere 
sequences of an avian chromosome. 

The invention further provides method for incorporating at least one transgene into 
30 the genome of a spermatozoon cell or a precursor thereof isolated from a donor male avian, 
and returning the modified cell to the testis of a recipient male avian, preferably the donor 
male avian, so that a genetically modified male gamete is produced by the male avian. 
Breeding the male avian with a female of its species will generate a transgenic progeny 
carrying the at least one transgene in its genome. 

35 
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The invention also provides methods for introducing a heterologous nucleic acid to 
an avian oocyte in addition to those described in United States Application Serial No. 
09/877,374, filed June 8, 2001, entitled "Production of Monoclonal Antibody By a 
Transgenic Chicken", by Jeffrey C. Rapp; and United States Application Serial No. 

5 9 filed September 1 8, 2002, entitled "Production of a Transgenic Avian By 

Cytoplasmic Injection", by Jeffrey C. Rapp and Leandro Christmann, both of which are 
incorporated by reference herein in their entireties. In certain embodiments, the avian 
oocyte is removed from the ovaries of a donor female avian to facilitate in vitro fertilization 
by the modified avian sperm of the invention. In other certain embodiments, the modified 

10 avian sperm is delivered to an avian oocyte in vivo by natural coitus. The fertilized ova is 
then, preferably, returned to or maintained in the oviduct of the donor female avian or a 
surrogate female avian to be laid as a hard-shell egg or, as an alternative, cultured ex vivo. 
The hard-shell egg is incubated and hatched, producing a transgenic chick that expresses a 
heterologous protein and/or that can be bred to generate a line of transgenic avians 

1 5 expressing a heterologous protein. 

Preferably, the avian sperm or the reproductive system of a male avian, preferably 
the seminiferous tubules and/or site of sperm production, development, and/or storage in the 
testis, is irradiated by gamma rays before transgene incorporation. More preferably, the 
transgene is integrated directly into the genome of the avian sperm. Most preferably, the 

20 transgene further comprises centromere and/or telomere sequences. 

In particular embodiments, the level of mosaicism of the transgene (percentage of 
cells containing the transgene) in avians hatched from sperm-mediated transfected embryos 
(*.£, the GOs) is greater than 5%, 10%, 25%, 50%, 75% or 90%, or is the equivalent of one 
copy per one genome, two genomes, five genomes, seven genomes or eight genomes, as 

25 determined by any number of techniques known in the art and described zw/ra. In 
additional particular embodiments, the percentage of GOs that transmit the transgene to 
progeny (Gls) is greater than 5%, preferably, greater than 10%, 20%, 30%, 40%, and, most 
preferably, greater than 50%. 

In certain other embodiments, the level of transgenics that result from mating with a 

30 wild type or transgenic avian avians hatched from sperm-mediated transfected embryos (i e. , 
the GOs) is greater than 5%, 10%, 25%, 50%, 75% or 90%. 

In another embodiment, the present invention provides methods for producing 
heterologous proteins in avians. Transgenes are introduced by sperm-mediated transfection 
into the genome of an avian oocyte which becomes fertilized and then develops into a 

35 transgenic avian. The heterologous protein(s) of interest may be expressed in the tubular 
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gland cells of the magnum of the oviduct, secreted into the lumen, and most preferably, 
deposited within the egg white onto the egg yolk or expressed, for example, in the serum of 
the avian. In preferred embodiments, the level of expression of the heterologous protein in 
the egg white of eggs laid by GO and/or Gl chicks and/or their progeny is greater than 5 ng, 
5 10 ng, 50 fig, 100 jig, 250 jig, 500 ^g or 750 jig, more preferably greater than 1 mg, 2 mg, 5 
mg, 10 mg, 20 mg, 50 mg, 100 mg, 200 mg, 500 mg, 700 mg, 1 gram, 2 grams, 3 grams, 4 
grams or 5 grams. 

The transgenic avians can also be bred to identify those avians that carry the 
transgene in their germ line. The exogenous gene coding for the heterologous proteins can 

1 0 therefore be transmitted by sperm-mediated transfection of the exogenous gene into the 
avian oocytes, and by subsequent stable transmission of the exogenous gene to the avian's 
offspring in a Mendelian fashion. More information on Mendelian inheritance can be found 
in Hartl and Jones, 2001, Genetics: Analysis of Genes and Genomes, 5th ed., Jones & 
Bartlett Publishers, Inc., the content of which is incorporated by reference herein in its 

15 entirety. 

Another aspect of the invention provides for the isolation of heterologous proteins in 
transgenic avians and the use thereof in pharmaceutical products including but not limited 
to vaccines, biologies and, particularly, therapeutically or diagnostically useful antibodies. 
The expressed heterologous protein(s) of interest may be collected and processed using 

20 standard techniques from the avian eggs, preferably the egg white, the serum, or other 
tissues from the transgenic avian. 

The present invention further provides methods for producing a heterologous protein 
in an avian oviduct. The method comprises, as a first step, providing a vector containing a 
coding sequence and a promoter that functions in avians, preferably in the avian magnum, 

25 operably linked to the coding sequence, so that the promoter can effect expression of the 
nucleic acid in the tubular gland cells of the magnum of an avian oviduct and/or in any other 
desired tissue of the avian. In a preferred embodiment, the vector containing the transgene 
is not a eukaryotic viral vector (preferably, not a retroviral vector, such as but not limited to 
reticuloendotheliosis virus (REV), ALV or MMLV) or derived from a eukaryotic virus (but, 

30 in certain embodiments, may contain promoter and/or other gene expression regulatory 
sequences from a eukaryotic virus, such as, but not limited to, a cytomegalovirus promoter). 
Next, the vector is introduced into avian sperm in vitro by lipofection, electroporation, 
restriction enzyme mediated integration (REMI) or similar methods, or in vivo by directly 
injecting into the testis, so that the vector sequence may be incorporated into the avian 

35 sperm. In preferred embodiments, the avian sperm or precursor cells are irradiated by 
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gamma rays before the vector sequence is incorporated therein. In another preferred 
embodiment, the vector sequence further comprises centromere and/or telomere sequence. 
Then, the modified avian sperm are delivered to an avian oocyte by natural coitus or in vitro 
by microinjection or artificial insemination to form a transgenic embryonic cell. In certain 
5 embodiments, the recipient avian oocyte is wild type unmodified or preferably, modified m 
a manner that facilitates the delivery of transgene by the modified avian sperm. In certain 
other embodiments, the recipient avian oocyte is derived from a first-generation or 
preferably, second-generation transgenic avian whose germ-line carries the transgene. 
Finally, a mature transgenic avian that expresses the exogenous protein in its oviduct is 
10 derived from the transgenic embryonic cell or by breeding a transgenic avian derived from 
the transgenic embryonic cell. 

The present invention further provides promoters useful for expression of the 
heterologous protein in the egg. For example, the transgene may comprise regions of at 
least two promoters derived from an avian including, but not limited to, an oviduct-specific 
15 promoter such as ovalbumin, lysozyme, ovomucoid, ovottansferrin, conalbumin, and 
ovomucin promoter or any other promoter that directs expression of a gene in an avian, 
particularly in a specific tissue of interest, such as the magnum, and a protamine promoter, 
or a fragment thereof which is sufficient to drive the expression of a marker gene such as 
Green Fluorescent Protein (GFP). Alternatively, the promoter used in the expression vector 
20 may be derived from that of the lysozyme gene that is expressed in both the oviduct and 
macrophages. In particular embodiments, the gene regulatory sequences are flanked by 
matrix attachment regions (MARs), preferably, but not limited to those associated with the 
lysozyme gene in chickens or other avians. The nucleic acid encoding the polypeptide may 
be operably linked to a transcription promoter and/or a transcription terminator. 
25 Other embodiments of the invention provide for transgenic avians, such as chickens 

or quail, carrying a transgene in the genetic material of their germ-line tissue, preferably 
where the transgene was not introduced into the avian genome using a eukaryotic viral 
promoter. The transgene incorporated into the genomic DNA of a recipient avian can 
encode at least one polypeptide that may be, for example, but is not limited to, a cytokine, a 
30 growth factor, enzyme, structural protein, immunoglobulin, or any other polypeptide of 
interest that is capable of being expressed by an avian cell or tissue. Preferably, the 
heterologous protein is a mammalian, preferably a human, protein or derived from a 
mammalian, or preferably a human, protein («. g. , a derivative or variant thereof). In 
particular embodiments, the invention provides heterologous proteins isolated or purified 
35 from an avian tissue, preferably serum, more preferably eggs, most preferably egg whites, 
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and pharmaceutical compositions comprising such heterologous proteins. In a more 
preferred embodiment, the heterologous protein is an antibody that is human (including 
antibodies produced from human immunoglobulin sequences in mice or in antibody 
libraries or synthetically produced but having variable domain framework regions that are 
5 the same as or homologous to human framework regions) or humanized. 

The present invention further relates to nucleic acid vectors (preferably, not derived 
from eukaryotic viruses, except, in certain embodiments, for eukaryotic viral promoters and/ 
or enhancers) and transgenes inserted therein that incorporate multiple polypeptide- 
encoding regions, wherein a first polypeptide-encoding region is operatively linked to a 
10 transcription promoter and a second polypeptide-encoding region is operatively linked to an 
Internal Ribosome Entry Sequence (IRES). For example, the vector may contain coding 
sequences for two different heterologous proteins (e.g., the heavy and light chains of an 
immunoglobulin) or the coding sequences for all or a significant part of the genomic 
sequence for the gene from which the promoter driving expression of the transgene is 
15 derived, and the heterologous protein desired to be expressed {e.g., a construct containing 
the genomic coding sequences, including introns, of the avian tysozyme gene when the avian 
lysozyme promoter is used to drive expression of the transgene, an IRES, and the coding 
sequence for the heterologous protein desired to be expressed downstream (i.c, 3' on the 
RNA transcript of the IRES). Thus, in certain embodiments, the nucleic acid encoding the 
20 heterologous protein is introduced into the 5' untranslated or 3' untranslated regions of an 
endogenous gene, such as but not limited to, ovalbumin, lysozyme, ovomucoid, 
ovotransferrin, conalbumin, and ovomucin, with an IRES sequence directing translation of 
the heterologous sequence. 

Such nucleic acid constructs, when inserted into the genome of an avian and 
25 expressed therein, will generate individual polypeptides that may be post-translationally 
modified, for example, glycosylated or, in certain embodiments, form complexes, such as 
heterodimers with each other in the white of the avian egg. Alternatively, the expressed 
polypeptides may be isolated from an avian egg and combined in vitro, or expressed in a 
non-reproductive tissue such as serum. In other embodiments, for example, but not limited 
30 to, when expression of both heavy and light chains of an antibody is desired, two separate 
constructs, each containing a coding sequence for one of the heterologous proteins operably 
linked to a promoter (either the same or different promoters), are introduced into embryonic 
cells by sperm-mediated transfection to generate transgenic avians that harbor both 
transgenes in their genomes and expressing both heterologous proteins are identified. 
35 Alternatively, two transgenic avians each containing one of the two heterologous proteins 
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(e.g., one transgenic avian having a transgene encoding the light chain of an antibody and a 
second transgenic avian having a transgene encoding the heavy chain of the antibody) can 
be bred by Mendelian genetics to obtain an avian containing both transgenes in its germline 
and expressing both transgene encoded proteins, preferably in eggs. (See Hartl and Jones, 

5 2001, Genetics: Analysis of Genes and Genomes, 5th ed., Jones & Bartlett Publishers, Inc., 
the content of which is incorporated by reference herein in its entirety). 

For convenience, certain terms employed in the specification, examples, and 
appended claims are collected here. 

Additional objects and aspects of the present invention will become more apparent 

1 0 upon review of the detailed description set forth below when taken in conjunction with the 
accompanying figures, which are briefly described as follows. 

3.1 DEFINITIONS 

The term "animal" as used herein refers to all vertebrate animals, including birds. It 
15 also includes an individual animal in all stages of development, including embryonic and 
fetal stages. 

The term "avian" as used herein refers to any species, subspecies or race of organism 
of the taxonomic class aves, such as, but not limited to, chicken, quail, turkey, duck, goose, 
pheasants, parrots, finches, hawks, crows and ratites including ostrich, emu and cassowary. 
20 The term includes the various known strains of Gallus gallus, or chickens, (for example, 
White Leghorn, Brown Leghorn, Barred-Rock, Sussex, New Hampshire, Rhode Island, 
Ausstralorp, Minorca, Amrox, California Gray, Italian Partridge-colored), as well as strains 
of turkeys, pheasants, quails, duck, ostriches and other poultry commonly bred in 
commercial quantities. 

25 The term "male germ cells" as used herein refers to sperm, sperm cells, spermatozoa 

(i.e., male gametes) and developmental precursors thereof Male germ cells with the 
capacity to swim and transfer nucleic acid to an ovum are herein referred to as "viable male 
germ cells." In fetal development, primordial germ cells are thought to arise from the 
embryonic ectoderm, and are first seen in the epithelium of the endodermal yolk sac at the 

30 E8 stage. From there they migrate through the hindgut endoderm to the genital ridges. In 
the sexually mature male vertebrate animal, there are several types of cells that are 
precursors of spermatozoa, and which can be genetically modified, including the primitive 
spermatogonial stem cells, known as AO/As, which differentiate into type B spermatogonia. 
The latter further differentiate to form primary spermatocytes, and enter a prolonged meiotic 

35 prophase during which homologous chromosomes pair and recombine. Useful precursor 
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cells at several morphological/developmental stages are also distinguishable: preleptotene 
spermatocytes, leptotene spermatocytes, zygotene spermatocytes, pachytene spermatocytes, 
secondary, spermatocytes, and the haploid spermatids. The latter undergo further 
morphological changes during spermatogenesis, including the reshaping of their nucleus, 

5 the formation of aerosome, and assembly of the tail. The final changes in the spermatozoon 
(i.e., male gamete) take place in the genital tract of the female, prior to fertilization. 

The terms "ovum" and "oocyte" are used interchangeably herein. Although only 
one ovum matures at a time, an animal is born with a finite number of ova. In avian 
species, such as a chicken, ovulation, which is the shedding of an egg from the ovarian 

10 follicle, occurs when the brain's pituitary gland releases a luteinizing hormone, LH. Mature 
follicles form a stalk or pedicel of connective tissue and smooth muscle. Immediately after 
ovulation the follicle becomes a thin-walled sac, the post-ovulatory follicle. The mature 
ovum erupts from its sac and starts its journey through the oviduct. Eventually, the ovum 
enters the infundibulum where fertilization occurs. Fertilization must take place within 1 5 

1 5 minutes of ovulation, before the ovum becomes covered by albumen. During fertilization, 
sperm (avians have polyspermic fertilization) penetrate the blastodisc. When the sperm 
lodges within this germinal disk, an embryo begins to form as a "blastoderm" or "zygote." 

The term "embryonic cells" as used herein refers to cells that are typically single cell 
embryos, fertilized or unfertilized, or the equivalent thereof, and is meant to encompass 

20 dividing embryos, such as two-cell, four-cell, or even later stages as described by Eyal- 
Giladi and Kochav (1976, Dev. Biol 49: 321-337) and ova 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 
14, 16, 18, or 20 hours after the preceding lay. The embryonic cells may be isolated freshly, 
maintained in culture, or reside within an embryo. 

The term "fragment" as used herein to refers to an at least 10, 20, 50, 75, 100, 150, 

25 200, 250, 300, 500, 1000, 2000 or 5000 nucleotide long portion of a nucleic acid (e.g., 
cDNA) that has been constructed artificially {e.g. , by chemical synthesis) or by cleaving a 
natural product into multiple pieces, using restriction endonucleases or mechanical shearing, 
or enzymatically, for example, by PCR or any other polymerizing technique known in the 
art, or expressed in a host cell by recombinant nucleic acid technology known to one of skill 

30 in the art. The term "fragment" as used herein may also refer to an at least 5, 10, 20, 30, 40, 
50, 75, 100, 150, 200, 250, 300, 400, 500, 1000, 2000 or 5000 amino acid portion of a 
polypeptide, which portion is cleaved from a naturally occurring polypeptide by proteolytic 
cleavage by at least one protease, or is a portion of the naturally occurring polypeptide 
synthesized by chemical methods or using recombinant DNA technology (e.g., expressed 

35 
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from a portion of the nucleotide sequence encoding the naturally occurring polypeptide) 
known to one of skill in the art. 

The term "isolated nucleic acid" as used herein refers to a nucleic acid that has been 
removed from other components of the cell containing the nucleic acid or from other 

5 components of chemical/synthetic reaction used to generate the nucleic acid. In specific 
embodiments, the nucleic acid is 50%, 60%, 70%, 80%, 90%, 95%, 99% or 100% pure. 
The "isolated nucleic acid" is neither (a) identical to that of any naturally occurring nucleic 
acid nor (b) identical to that of any fragment of a naturally occurring genomic nucleic acid 
spanning more than three separate genes, and includes DNA, RNA, or derivatives or 

10 variants thereof. The term covers, for example, (a) a DNA which has the sequence of part 
of a naturally occurring genomic molecule but is not flanked by at least one of the coding 
sequences that flank that part of the molecule in the genome of the species in which it 
naturally occurs; (b) a nucleic acid incorporated into a vector or into the genomic nucleic 
acid of a prokaryote or eukaryote in a manner such that the resulting molecule is not 

1 5 identical to any vector or naturally occurring genomic DNA; (c) a separate molecule such as 
a cDNA, a genomic fragment, a fragment produced by polymerase chain reaction (PCR), 
ligase chain reaction (LCR) or chemical synthesis, or a restriction fragment; (d) a 
recombinant nucleotide sequence that is part of a hybrid gene, z.e., a gene encoding a fusion 
protein; and (e) a recombinant nucleotide sequence that is part of a hybrid sequence that is 

20 not naturally occurring. The techniques used to isolate and characterize the nucleic acids 
and proteins of the present invention are well known to those of skill in the art and standard 
molecular biology and biochemical manuals may be consulted to select suitable protocols 
without undue experimentation. See, e.g., Sambrook et al, Molecular Cloning: A 
Laboratory Manual, 3rd ed., Cold Spring Harbor Press (2001); the content of which is 

25 herein incorporated by reference in its entirety. 

By the use of the term "enriched" in reference to nucleic acid it is meant that the 
specific DNA or RNA sequence constitutes a significantly higher fraction of the total DNA 
or RNA present in the cells or solution of interest than in normal or diseased cells or in the 
cells from which the sequence was taken. Enriched does not imply that there are no other 

30 DNA or RNA sequences present, just that the relative amount of the sequence of interest 
has been significantly increased, for example, by 1 fold, 2 fold, 5 fold, 10 fold, 50 fold, 100 
fold, 500 fold, 1000 fold, 10,000 fold, 100,000 fold or 1,000,000 fold. The other DNA may, 
for example, be derived from a yeast or bacterial genome, or a cloning vector, such as a 
plasmid or a viral vector. 

35 
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The term 'transcription regulatory sequences" as used herein refers to nucleotide 
sequences that are associated with a gene nucleic acid sequence and that regulate the 
transcriptional expression of the gene. The 'transcription regulatory sequences" may be 
isolated and incorporated into a vector nucleic acid to enable regulated transcription in 

5 appropriate cells of portions of the vector DNA. Exemplary transcription regulatory 
sequences include enhancer elements, hormone response elements, steroid response 
elements, negative regulatory elements, and the like. The 'transcription regulatory 
sequences" may be isolated and incorporated into a vector nucleic acid to enable regulated 
transcription in appropriate cells of portions of the vector DNA. The 'transcription 

1 0 regulatory sequence" may precede, but is not limited to, the region of a nucleic acid 
sequence that is in the region 5* of the end of a protein coding sequence that may be 
transcribed into mRNA. Transcriptional regulatory sequences may also be located within a 
protein coding region, in regions of a gene that are identified as "intron" regions, or may be 
in regions of nucleic acid sequence that are in the region of nucleic acid. 

1 5 The term "promoter" as used herein refers to the DNA sequence that determines the 

site of transcription initiation by an RNA polymerase. A "promoter-proximal element 5 ' may 
be a regulatory sequence within about 200 base pairs of the transcription start site. A 
"magnum-specific" promoter, as used herein, is a promoter that is primarily or exclusively 
active in the tubular gland cells of the avian magnum. Useful promoters also include 

20 exogenously- inducible promoters. These are promoters that can be "turned on" in response 
to an exogenously supplied agent or stimulus, which is generally not an endogenous 
metabolite or cytokine. Examples include an antibiotic-inducible promoter, such as a 
tetracycline-inducible promoter, a heat-inducible promoter, a light-inducible promoter, or a 
laser inducible promoter. (See, e.g., Halloran et al 9 2000, Development 127: 1953-1960; 

25 Gemer et al, 2000, Int. J. Hyperthermia 16: 171-81; Rang and Will, 2000, NucleicAcids 
Res.28: 1120-5; Hagihara et al, 1999, Cell Transplant 8: 4314; Huang etal., 1999,M>/. 
Med. 5: 129-37; Forster et al, 1999, Nucleic Acids Res. 27: 708-10; Liu et al, 1998, 
Biotechniques 24: 624-8, 630-2; the contents of which have been incorporated herein by 
reference in their entireties). 

30 To facilitate manipulation and handling of the nucleic acid to be administered, the 

nucleic acid is preferably inserted into a cassette where it is operably linked to a promoter. 
The promoter should be capable of driving expression in the desired cells. The selection of 
appropriate promoters can be readily accomplished. For some applications, a high 
expression promoter is preferred such as the cytomegalovirus (CMV) promoter. Other 

35 promoters useful in the present invention include the Rous Sarcoma Virus (RSV) promoter 
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(Davis et al, 1993, Hum. Gene Therap. 4:151). In other embodiments, all or a portion of 
the, for example, lysozyme, ovomucoid, albumin, conalbumin or ovotransferrin promoters, 
which direct expression of proteins present in egg white, are used, as detailed infra, or 
synthetic promoters such as the MDOT promoter described infra. 

5 The term "expressed" or "expression" as used herein refers to the transcription from 

a gene to give an RNA nucleic acid molecule complementary at least in part to a region of 
one of the two nucleic acid strands of the gene. The term "expressed" or "expression" as 
used herein also refers to the translation from said RNA nucleic acid molecule to give a 
protein or polypeptide or a portion thereof. 

10 The term "matrix attachment regions" as used herein refers to DNA sequences 

having an affinity or intrinsic binding ability for the nuclear scaffold or matrix. The MAR 
elements of the chicken lysozyme locus were described by Phi- Van et al 9 1996, EM.B.O. J. 
76:665-664 and Phi-Van, L. and Stratling, W.H., 1996, Biochem. 35:10735-10742; 
incorporated herein by reference in their entireties. 

15 The term "nucleic acid vector" as used herein refers to a natural or synthetic single 

or double stranded plasmid or viral nucleic acid molecule, or any other nucleic acid 
molecule, such as but not limited to YACs, BACs, bacteriophage-derived artificial 
chromosome (BBPAC), cosmid or PI derived artificial chromosome (PAC), that can be 
transfected or transformed into cells and replicate independently of, or within, the host cell 

20 genome. A circular double stranded vector can be linearized by treatment with an 

appropriate restriction enzyme based on the nucleotide sequence of the vector. A nucleic 
acid can be inserted into a vector by cutting the vector with restriction enzymes and ligating 
the pieces together. The nucleic acid molecule can be RNA or DNA. 

The term "expression vector" as used herein refers to a nucleic acid vector that 

25 comprises regulatory sequences operably linked to a nucleotide sequence coding for at least 
one polypeptide. As used herein, the term "regulatory sequences" includes promoters, 
enhancers, and other elements that may control expression. Standard molecular biology 
textbooks such as Sambrook et al , (supra) and Lodish et al , eds "Molecular Cell Biology" 
Freeman (2000) and incorporated herein by reference in their entireties, may be consulted to 

30 design suitable expression vectors, promoters, and other expression control elements. It 
should be recognized, however, that the choice of a suitable expression vector depends upon 
multiple factors including the choice of the host cell to be transformed and/or the type of 
protein to be expressed. Also useful for various applications are tissue-selective (i.e., tissue- 
specific) promoters, Le. 9 promoters from which expression occurs preferentially in cells or a 

35 particular kind of tissue, compared to one or more other types of tissue. For example, 
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15 



chicken oviduct-specific promoters naturally associated with the proteins of avian egg 
whites including, but not limited to, lysozyme, ovomucoid, albumin, conalbumin, and 
ovotransferrinmaybeused. 

The term "recombinant cell" refers to a cell mat has a new combination of nucleic 
5 acid segments that are not covalently linked to each other in nature in that parhcular 
configuration. A new combination of nucleic acid segments can be introduced into an 
organismusmgawidea^^ . 
cell oramammaliancell. Tbe recombinant cell may harbor a vector that is extragenomic 
10 tof**^^*^^*™**^*"™ Arecombinant 
cell can further harbor a vector or a portion thereof (e.g. , the portion containing the 
regulatory sequences and the coding sequence) that is intragenomic. The term 
„omic«^^ 

8en0me The terms "recombinant nucleic acid" and "recombinant DNA" as used herein refer 
to a combination of at least two nucleic acid sequences that is not naturally found in a 
eukaryotic or prokaryotic cell in that particular configuration. The nucleic acid sequences 
may include, but are not limited to, nucleic acid vectors, gene expression regulatory 

elements, origins of ,4^*^^^*^^*^^*^ 
20 andprotein-encodingsequences. The term "recombinant polypeptide" is meant to include a 

polypeptide produced by recombinant DNA techniques such that it is distinct from a 
naturally occurring polypeptide eimer in its location, purity or structme. Generally, such a 
recombinant polypeptide will be present m a ceU m an amount afferent from that normally 
observed in nature. 

25 As used herein, the term "transgene" refers to a nucleic acid sequence (encoding, for 

example, ahuman interferon polypeptide) that is partly or entirely heterologous, i.e., 
foreign, to the transgenic animal or cell into which it is introduced, or, is homologous to an 
endogenous gene of the transgenic animal or cell intowhichit is introduced, but which is 
designed to be inserted, or is inserted, into the M^^mA^mm^^ 
30 genome of the cell into which it is inserted (e.g., it is inserted at a location which differs 
from that of the natural gene or its insertion results in a knockout). A transgene also 
includes aregulatory sequence designed to be inserted into the genome such that it regulates 
the expression of an endogenous coding sequence, e.g., to increase expression and or to 
change the timing and or tissue specificity of expression, etc. (e.g., to effect "gene 
35 activation"). 
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The term "transgenic animal" as used herein refers to an animal, including an avian 
speciessnchasacMcken.mwMchoneormoreofmeccllsofmea^conta^ 

heterologous nucleic acid introduced by way of human intervention, such as by transgemc 

tecnmquesweUknow™^^ +r . . 

5 As used herein, a "transgenic avian" is any avian species, including but not hmited 

to chicken, turkey, duck, goose, quail, pheasants, parrots, finches, hawks, crows and ratites 
including ostrich, emu and cassowary, in which one or more oftheceUs oftheav.au may 
contain heterologous nucleic acid introduced by way of human intervention, such as by 
transgenic techniques known in the art, and particularly, a, described hercm. Tlie nuclei 
1 0 acid is introduced into a cell, directly or indirectly by introduction into a precursor of me 
cell by way of deliberate genetic manipulation, such as by microinjection or by infection 
with a recombinant virus. The term genetic manipulation does not include classical cross- 
breeding, or in vitro fertilization (although it does include fertilization with sperm into 
which a transgene has been introduced, but rather is directed to the introduction of a 
15 recombinant DNA molecule. This molecule may be integrated within a chromosome, or it 
m ay be extrachromosomally replicating DNA. In the typical transgenic avian, the transgene 
causes cells to express a recombinant form of the subject polypeptide, e.g. either agoiustic 
orantagonisticforms,oraforminwhichthegenehasbeendisrupted. 

The terms "chimeric animal" or "mosaic animal" are used herein to refer to annuals 
20 in which the recombinant gene is found, or in which_the recombinant is expressed m some 
but not all cells of the animal. The term "tissue-specific chimeric animal" indicates that the 
polypeptide encoding gene is present and expressed in some tissues, but not others. 

The term "knock-in animal" refers to an animal that carries a specific nuclexc acid 
sequence such as a "knock-in sequence" in a predetermined coding or noncoding region, 
25 wherein the knock-in sequence is introduced through methods of recombination, such as 
homologous recombination. The recombination event comprises replacing all or part of a 
gene of the animal by a functional homologous gene or gene segment of another animal, 
where the respective knock-in sequence is placed in the genomic sequence. 

The term "chromosomal positional effect (CPE)" as used herein refers to the 
30 variation in the degree of gene transcription as a function of the location of the transcribed 
locus within the cell genome. Random transgenesis may result in a transgene being mserted 
at different locations in the genome so that individual cells of a population of transgemc 
cells may each have at least one transgene, each at a different location and therefore each in 
a different genetic environment Each cell, therefore, may express the transgene at a level 
35 specific for that particular cell and dependant upon the immediate genetic environment of 
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the transgene. In a transgenic avian, as a consequence, different tissues may exhibit 
different levels of transgene expression. 

The term "cytokine" as used herein refers to any secreted polypeptide that affects the 

functoofceUsandisamol^ , a 

5 immune, inflammatory or hematopoietic response. A cytokine includes, but is not hrmted 
to monokines and lymphokines regardless of which cells produce them. For instance a 
monokine is generally referred to as being produced and secreted by a mononuclear ceU, 
such as a macrophage and/or monocyte. Many other cells however also produce monokmes, 
such as natural killer cells, fibroblasts, basophils, neutrophils, endothelial cells, brain 
10 astrocytes, bone maxrow stromal 

Lymphokines are generally referred to as being produced by lymphocyte cells. Examples of 
cytokines include, but are not limited to, Interleukin-1 (IL-1), Merleukin-6 (IL-6), 
Interleukin-8 (IL-8), Tumor Necrosis Factor-alpha (TOF-alpha) and Tumor Necrosis Factor 

1 5 The term "antibody" as used herein refers to polyclonal and monoclonal antibodies 

and fragments thereof, and immunologic binding equivalents thereof. The term "antibody- 
refers to a homogeneous molecular entity, or a mixture such as a polyclonal serum product 
made up of a plurality of different molecular entities, and may further compnse any 
modified or derivatised variant thereofthat retains the ability to specifically bind an epitope. 

. , w e «f^wrivelvhmdine to a target antigen or epitope. 

a monoclonal anaoouy ia wopa^c «x ^ ., - ~ =• 

Antibodies may include, but are not limited to polyclonal antibodies, monoclonal antibodies 
(mAbs) humanized or chimeric antibodies, single chain antibodies, Fab fragments, F(ab ) 2 
fragments, disulfide-linked Fvs (sdFv) fragments produced by a Fab expression library, anti- 
idiotypic (anti-Id) antibodies, intrabodies, synthetic antibodies, and epitope-bmdmg 

25 fragments of any of the above. 

The term "immunoglobulin polypeptide" as used herein refers to a polypeptide 
derived from a constituent polypeptide of an immunoglobulin. An "immunoglobulin 
polypeptide" may be, but is not limited to, an immunoglobulin (preferably an antibody) 
heavy or light chain and may include a variable region, a diversity region, joining region and 

30 a constant region or any combination, variant or truncated form thereof. The term 

"immunoglobulin polypeptide" further includes single-chain antibodies comprised of, but 
not limited to, an immunoglobulin heavy chain variable region, an immunoglobulin light 
chain variable region and optionally a peptide linker. 

The term "origin of replication" (orQ as used herein refers to unique regions of a 

35 nucleic acid sequence containing multiple short repeated sequences, recognized by 
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multimeric origin of replication binding proteins that organize the assembly of multiple 
enzymes and proteins required for the replication of the nucleic acid. The origin of 
replication derived from K coli may be included in a plasmid for replication of the plasmid 
in a bacterial host. The SV40 viral ori is a 65 bp region derived from the SV40 viral 
5 chromosome that when included in a nucleic acid sequence will allow replication of the 
nucleic acid in an animal cell. Inclusion of the SV40 ori region in a plasmid that also has 
the E. coli ori element will allow the plasmid to be replicated in both a bacterial host and in 
an animal cell. 

The term "centromere" as used herein refers to a small, specialized region of a 
10 chromosome recognized as a constriction in a condensed chromosome. A kinetochore lies 
within the centromeric region and is attached to microtubules extending to the poles of a 
dividing cell. 

The term "telomere" as used herein refers to repetitive oligomeric nucleic acid 
sequences located at the ends of linear eukaryotic chromosomes. Telomeres are required to 
15 prevent shortening of chromosomal DNA during replication of the linear nucleic acid. 

Recombinant expression vectors can be designed for the expression of the encoded 
proteins eukaryotic cells. Useful vectors may comprise constitutive or inducible promoters 
to direct expression of either fusion or non-fusion proteins. With fusion vectors, a number 
of amino acids are usually added to the expressed target gene sequence such as, but not 
20 limited to, a protein sequence for thioredoxin. A proteolytic cleavage site may further be 
introduced at a site between the target recombinant protein and the fusion sequence. 
Additionally, a region of amino acids, such as a polymeric histidine region, may be 
introduced to allow binding of the fusion protein to metallic ions such as nickel bonded to a 
solid support, and thereby allow purification of the fusion protein. Once the fusion protein 
25 has been purified, the cleavage site allows the target recombinant protein to be separated 
from the fusion sequence. Enzymes suitable for use in cleaving the proteolytic cleavage site 
include, but are not nmited to, Factor Xa and thrombin. Fusion expression vectors that may 
be useful in the present invention include pGex (Amrad Corp., Melbourne, Australia), 
pRIT5 (Pharmacia, Piscataway, NJ) and pMAL (New England Biolabs, Beverly, MA), lhat 
30 fose glutathione S-transferase, protein A, or maltose E binding protein, respectively, to the 
target recombinant protein. 

Expression of a foreign gene can be obtained using eukaryotic host cells such as, but 
not limited to, mammalian or avian cells. The use of eukaryotic host cells permit partial or 
complete post-translational modification such as, but not only, glycosylation and/or the 
35 formation of the relevant inter- or intra-chain disulfide bonds. Examples of vectors useful 
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for expression in the chicken GaUvs ga/ta include pYepSecl as in Baldari « at, 

E m7o .J. 6, 229-34 (1987) and pYES2 (hrvhrogen Corp., San Diego, CA), rocorporoted 

Lm by reference in tneir enurefie, Once Ore isolated DNA molecule of * pr«ent 

i^Ihaabeenelonedh^m^onsy^Hb^tobenKxnpo^n^a 

i^ganudeicacidimoacell. Marry teehniqnes are well known to drose skdled rn the 
arttoMUtaten^fonnadonorn^ecdonofanncleieacidintoaprokaryoncor 

elecurc field, decent, or liposome medial transfecdon, • render tire host ee l ™rpe«n, 
forttenp^eofthenneleieaeidmolecnlea.andby^me^asspernr-nredu^dand 

restriction-mediated integration. ttmra ju m A 
The «enn Wecung agen," as used herein refers to a composdron of matter added 

Examples of ttanafecfing agents include adenovirus-ttansferrm-polylyame-DNA «,mplexes. 
20 brealotow.duringtapassagethroughmecytoplasm.onrenucte These 
byleptorsonmeceUsurfi^ofthegemrcefi.suehasmec-ldthgandmmodmeattona 

^ Other preferred ttansfecmrg agents include, but are no, limited to, 
25 hpfectamhre, DIMRIE C, Supeffeet, and Effecfin (Qiagen), unifecfin, maxrfecttn, DOTMA, 
DOGS (Transfectam; dioctadecylamidoglycylspermine), DOPE (l,2-dioleoyl-sn^lycero-3- 
phosphoemanolanune), DOTAP a^oleoyl.3-himemylammoninm propane), DDAB 
(dimethyl diocttdecytammoninm bromide), DHDEAB (N.N-m-n-hexadecyl-N.N- 
dihydroxyedryl ammonium bromide), HDEAB (N-n-hexadecylN,N- 
30 dihydroxyethylammonium bromide), polybrone, or poly(emylenimtoe) (PEI). These 
njviral agents have fine advantage that they can facilitate smble integral of xenogeneic 
DN A fences into the vertebrate genome, without size resttiefiona common!, assocrated 
with virus-derived transfecting agents. 

The terms ^cytoplasmic sperm injection" and «ICSF as used herem refer to 
35 delivering an exogenous nucleic acid to a recipient cell by associating the exogenous 



•20 



WO 03/024199 



PCT/US02/30156 



nucleic acid with the head of a sperm cell and then delivering the sperm cell head to the 
recipient cell by microinjection. The exogenous nucleic acid may be integrated into the 
endogenous genomic nucleic acid of the sperm, non-integrated as an episomal element of 
the nucleic acid complement of the sperm head, or linked internally or externally to the head 
5 of the sperm. The terms "chlCSI" and "CfflCSI™" as used herein refer to intracytoplasmic 
sperm injection into a chicken cell. 

The terms "sub-zonal injection" and "SUZT refer to delivering viable spermatozoa 
to an oocyte by microinjection, wherein the sperm are microinjected between the zona 
pellucida and the cytoplasmic membrane of an oocyte. 
1 0 The term "gene delivery (or transfection) mixture" as used herein, in the context of 

the methods of sperm mediated transfer described herein, refers to selected genetic material 
in an appropriate vector mixed, for example, with an effective amount of lipid transfecting 
agent, for example, a cationic or polycationic lipid, such as polybrene. The amount of each 
component of the mixture is chosen so that the genetic modification, e.g., by transfection or 
1 5 transduction, of a specific species of male germ cell is optimized. Such optimization 
requires no more than routine experimentation. The ratio of DNA to lipid is broad, 
preferably about 1:1, although other proportions can also be utilized depending on the type 
of lipid transfecting agent used. 

This application uses gene nomenclature accepted by the Cucurbit Genetics 
20 Cooperative as it appears in the Cucurbit Genetics Cooperative Report 18:85 (1995); herein 
incorporated by reference in its entirety. Using this gene nomenclature, genes are 
symbolized by italicized Roman letters. If a mutant gene is recessive to the normal type, 
then the symbol and name of the mutant gene appear in italicized lower case letters. 

25 3.2 ABBREVIATIONS 

Abbreviations used in the present specification include the following: aa, amino 
acid(s); bp, base pair(s); cDNA, DNA complementary to RNA; mRNA, messenger RNA; 
tRNA, transfer RNA; nt, nucleotide^); SSC, sodium chloride-sodium citrate; MAR, matrix 
attachment region; DMSO, dimethyl sulfoxide; TPLSM, two photon laser scanning , 
30 microscopy; REMI, restriction enzyme mediated integration; WEFs, whole embryo 
fibroblasts. 

4. BRIEF DESCRIPTION OF THE FIGURES 

FIGS. 1 A-E illustrate the nucleotide sequence (SEQ ID NO: 6) comprising the 
35 chicken lysozyme gene expression control region (SEQ ID NO: 7), the nucleotide sequence 
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encoding the chicken expression optimized human interferon a2b (IFNMAGMAX; SEQ LD 
NO: 5) and a SV40 polyadenylation signal sequence (SEQ ID NO: 8). 

FIG. 2 illustrates the nucleotide sequence SEQ ID NO: 5 encoding the chicken 
5 expression optimized human interferon a2b (IFNMAGMAX). 

PIGS. 3 A-E illustrate the nucleotide sequence SEQ ID NO: 7 encoding the chicken 
lysozyme gene expression control region. 

10 FIG. 4 illustrates the nucleotide sequence SEQ ID NO: 8 encoding the SV40 

polyadenylation signal sequence. 

FIGS. 5A-C illustrate the nucleotide sequence SEQ ID NO: 9 encoding the chicken 
lysozyme 3' domain. 

15 J# . 

FIGS 6A-J illustrate the nucleotide sequence SEQ ID NO: 10 encoding the 
lysozyme gene expression control region (SEQ ID NO: 7) linked to the insert having the 
nucleotide sequence of SEQ ID NO: 5 encoding the chicken expression-optimized human 
interferon a2b (IFNMAGMAX) and the chicken lysozyme 3' domain SEQ ID NO: 9. 

FIG. 7 illustrates the nucleotide sequence SEQ ID NO: 11 of the combinatorial 
promoter MDOT. 

FIGS 8A-B illustrate the oligonucleotides and primers (SEQ ID NOS: 14-31) used 
25 in the formation of the chicken codon optimized human interferon a2b-encoding nucleic 
acid. 

FIG. 9 illustrates the primers (SEQ ID NOS: 32-35) used in the synthesis of the 
MDOT promoter. 

30 FIG. 10 illustrates the level of human monoclonal antibodies IgG expressed in the 

serum of transgenic chick using ELISA. 

FIG. 1 1 illustrates the detection of EGFP positive bands from transgenic sperm. 

35 
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5. DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to methods of introducing nucleic acids into avian 
oocytes by sperm-mediated transfection to produce a transgenic chicken or quail, or other 
avian species, carrying the transgene in the genetic material in all or most of its tissue, 

5 including germ-line tissue. The methods and vectors of the present invention further 

generate transgenic avians that express heterologous genes in the serum of the avian and/or 
are deposited into an avian egg, preferably in the egg white. Vectors containing promoters 
that direct high level of expression of the heterologous protein in the avian, particularly in 
the magnum for deposition into the avian egg are provided. Additional regulatory elements, 

10 such as MARs, IRES's, enhancers, polyadenlyation signals, etc., may be included in the 
vectors of the invention to improve expression and efficiency. 

5.1 METHODS QE TRANSGENESIS 
5.1.1 SPERM-MEDIATED INTEGRATION OF HETEROLOGOUS 
15 TRANSGENES 

The transgenic avians of the present invention are most preferably generated using 
sperm-mediated transfection of nucleic acid into avian oocytes. Specifically, the present 
invention provides methods for introducing nucleic acids containing a transgene, preferably, 
a nucleic acid vector of the invention as described in Section 5.2, infra, into an avian oocyte 

20 by sperm-mediated transfection. In preferred embodiments, the nucleic acid is first 
introduced into an avian sperm in vitro by lipofection, electroporation, restriction enzyme 
mediated integration (REMI) or similar methods, or in vivo by microinjection into the testis, 
and the modified avian sperm is then delivered to an avian oocyte by natural coitus after the 
modified avian sperm are returned to the testis of a male avian or in the method in which the 

25 nucleic acid has been injected directly into the testis or in vitro by microinjection, 

intracytoplasmicosperm injection (ICSI) or artificial insemination of oocytes isolated from 
an ovulating female bird, thereby generating a transgenic 2ygote and chick. In certain 
embodiments, the male germ cells are irradiated, more preferably irradiated by gamma rays, 
before the heterologous nucleic acid is incorporated therein. In other embodiments, the 

30 testis is depopulated of sperm prior to introduction of the transfected sperm. 

The present invention contemplates that any technique capable of transferring 
heterologous material into sperm could be used so long as the technique preserves enough 
of the sperm's fertilization functions, such that the resultant sperm will be able to fertilize 
the oocyte. It is understood that the heterologous nucleic acid may be integrated into the 

35 genome of a recipient cell such as a spermatogonial cell or a spermatogonial precursor cell 
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for subsequent transfer to an embryo or the testicular material of the recipient male animal, 
preferably a chicken. It is further understood that the heterologous nucleic acid may not be 
integrated into the genome of the recipient cell but delivered as an episome which may or 
may not be integrated into the genome of the resulting zygote or chick. 

5 

5.1.1.1 PREPARATION OF TRANSGENIC CONSTRUCT 

One aspect of the present invention relates to the preparation of a transgene which is 
to be incorporated into the genome of an avian sperm. In certain embodiments, the 
transgene comprises at least one heterologous nucleic acid. It is contemplated to be within 
1 0 the scope of the present invention for the heterologous nucleic acid to comprise an 

expression vector such as, but not limited to, viral vectors, plasmid vectors, or linearized 
nucleic acid vectors or a combination thereof. (See section 5.2, infra, for details on vectors, 
and the preparation thereof). The expression vector may particularly be any suitable 
nonviral vector including plasmid DNA, bacteria artificial chromosomes (BACs), yeast 
1 5 artificial chromosomes (YACs), etc. The expression vector may also be any suitable viral 
vector, for example, retroviral vectors, adenoviral vectors, transferrin-polylysine enhanced 
adenoviral vectors, human immunodeficiency virus vectors, lentiviral vectors, Moloney 
murine leukemia virus-derived vectors, and virus-derived DNAs that facilitate 
polynucleotide uptake by and release into the cytoplasm of germs cells. 
20 Transcriptional promoters of an expression vector of the present i nvention may be a 

constitutively active promoter such as the cytomegaloviral promoter or Rous sarcoma virus 
promoter, or a tissue-specific promoter, preferably a tissue-specific promoter operable in 
oviduct cells of an avian species including, but not limited to, the promoters of the genes 
encoding ovalbumin, lysozyme, ovomucoid, ovotransferrin, conalbumin, and ovomucin. 
25 Optionally, the transcriptional promoter of an expression vector may be a regulatable 
promoter. The expression vector may further comprise a region encoding a transcriptional 
terminator, such as a bovine growth hormone transcriptional terminator. 

In preferred embodiments, a transgene construct comprises at least two separate or 
independent elements. A first element could comprise an oviduct-specific promoter, such 
30 as, but not limited to ovalbumin, lysozyme, ovomucoid, ovotransferrin, conalbumin, and 
ovomucin, which would drive expression of a gene coding for a protein of interest in the 
oviduct. A second element can be located either upstream or downstream for the first 
element and comprises a protamine promoter, or a segment thereof that is sufficient to drive 
the expression of a marker gene such as the Green Fluorescent Protein (GFP) to facilitate 
35 identification of transfected sperm. 
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In one embodiment of the present invention, the heterologous nucleic acid comprises 
cohesive ends characterized as capable of hybridizing to cohesive ends generated by a 
restriction endonuclease. The cohesive ends on the nucleic acid may be generated by 
restriction endonuclease cleavage of a circular or linear nucleic acid, by the chemical 
5 addition of nucleotides to the ends of a linear nucleic acid, or by a combination of chemical 
and enzymatic methods. 

In another embodiment of the present invention, the heterologous nucleic acid is 
linearized and has at least one blunt end. The blunt end of the nucleic acid may be 
generated, by an exonuclease digestion of cohesive ends, such as SI nuclease. 
10 In the methods of generating transgenic cells according to the present invention, the 

genomic nucleic acid of the recipient cell, male germ cell or oocyte can be cleaved to 
receive the integrating heterologous nucleic acid. Any method may be selected that will 
generate limited, random cleavage that will allow integration of the heterologous nucleic 
acid into the genome of the recipient cell or oocyte. When the integrating heterologous 
1 5 nucleic acid has cohesive ends, the recipient genomic nucleic acid may be cleaved with a 
restriction endonuclease generating cohesive ends capable of hybridizing to the cohesive 
ends of the heterologous nucleic end. When the heterologous nucleic acid has blunt ends, 
the genomic nucleic acid can be cleaved by any method that will generate blunt ends at the 
cleavage site, including restriction endonuclease cleavage, or irradiation of the cell with 
20 high-energy irradiation. Suitable radiations that may be applied to the methods of the 
present invention include, for example, gamma rays, x-rays, ultraviolet light or ultrasound. 
It is contemplated that the cleavage of genomic nucleic acid and integration of a 
heterologous nucleic acid therein will result in a viable recipient cell that can be used to 
fertilize an avian oocyte, or will not yield a viable cell. A non-viable sperm cell may, 
25 however, be used to deliver the transgene to an oocyte using, for example, the ICSI 
(CfflCSI™) method. 

The heterologous nucleic acid of the present invention may further comprise a 
centromere element and at least one telomere element. In one embodiment, the centromere 
and the at least one telomeres are derived from the chicken. While the ori site alone will 
30 allow replication of the heterologous nucleic acid when transfected into an oocyte or zygote 
thereof, segregation of the replicates into each daughter cell will require the optional 
centromeric element. In the absence of this centromeric element, segregation will be 
random between daughter cells with some daughter cells not receiving one copy of the 
transgenic nucleic acid. A mosaic transgenic animal would, therefore, result. 



-25- 



WO 03/024199 



PCT/US02/30156 



In one embodiment of the present invention, therefore, the heterologous nucleic acid 
is an artificial chromosome comprising a heterologous transgenic element having the 
properties desired to be expressed by a transgenic animal, an origin of replication (ori) site, 
and a centromere. In this embodiment, the heterologous nucleic acid may be a circular 

5 nucleic acid or a linear nucleic acid In another embodiment, the heterologous nucleic acid 
is a linear nucleic acid further comprising telomeres. 

In another aspect of the methods according to the present invention, the transgenic 
oocyte or ovum of the present invention is incubated for development of the zygote therein 
to a fetus, and subsequently to a chick for hatching. In one embodiment of the present 

10 invention, therefore, the zygote is incubated in a surrogate avian female, wherein the 

method comprises the steps of fistulating an avian female, delivering the avian oocyte to the 
infundibulum of the female bird, allowing the avian female to incubate the avian oocyte to 
an embryo within an egg, allowing the avian female to lay the egg, and allowing the embryo 
to hatch as a viable chick, wherein the chick is a transgenic chick having an exogenous 

15 nucleic acid incorporated therein. 

5.1.1.2 SPERM TRANSGENESIS 

The heterologous nucleic acid may be delivered to an avian male germ cell (Le., 
sperm, spermatozoon cell or a precursor cell) by a method such as by contacting the male 

20 germ cell with a gene delivery mixture comprising a nucleic acid, either a eukaryotic viral 
vector or a vector that is not derived from a eukaryotic virus, at about or below the avian's 
body temperature and for an effective period of time such that the nucleic acid is 
incorporated into the cell, and preferably into the genome of the cell, optionally isolating or 
selecting the genetically modified cell with the aid of a genetic selection marker expressed 

25 in the genetically modified cell, transferring the isolated or selected genetically modified 
germ cell to a testis of a recipient male avian such that the cell lodges in a seminiferous 
tubule of the testis. A genetically modified male gamete may be produced therein, and 
breeding the recipient male avian with a female avian of its species will generate transgenic 
progeny that carry the heterologous transgenic nucleic acid in its genome. 

30 In certain embodiments, the avian male germ cells are isolated and removed from a 

male avian. The avian male germ cells is then transfected by introducing the heterologous 
nucleic acid into the genome of the avian male germ cells by lipofection, electroporation, 
restriction enzyme mediated infection (REM) or similar methods. In certain other 
embodiments, the heterologous nucleic acid is injected directly into the testis of the male 

35 avian for transfection. Male germ cells can be extracted to determine whether transfection 
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has occurred or the extent of transfection. The male avian can be mated with a female avian 
to produce transgenic offsprings or the sperm can be used for IVF. 

The precursor cell may be selected from the group consisting of spermatogonial 
stem cells, type B spermatogonia, primary spermatocytes, preleptotene spermatocytes, 
5 leptotene spermatocytes, zygotene spermatocytes, pachytene spermatocytes, secondary 
spermatocytes, and spermatids. The embodiment further comprises the steps of 
incorporating the heterologous transgene into the genome of the spermatozoon cell or the 
precursor cell, so that a genetically modified male gamete is produced by the male avian, 
and breeding the male avian with a female of the same species such mat a transgenic 
10 progeny is thereby produced that carries the polynucleotide in its genome. 

In certain embodiments, the heterologous genetic material may be introduced into 
the genome of an avian male germ cell, such that a polynucleotide is delivered using known 
gene delivery systems to male germ cells in situ in the testis of the male avian (e.g., by in 
vivo transfection or transduction). In one embodiment, the invention relates to an in vitro 
1 5 method of incorporating heterologous genetic material into the genome of a male avian by 
isolating male germ cells ex corpora, delivering a polynucleotide thereto, and then returning 
the transfected cells to the testes of a recipient male bird. In yet another embodiment, the in 
vitro method involves microinjecting the recombinant male germ cells into a recipient 
fertilized oocyte, whereupon the sperm head enters the oocyte nucleus to deliver the 
20 heterologous nucleic acid thereto. 

In a preferred embodiment, the invention relates to an in vivo method that injects a 
gene delivery mixture, preferably into the seminiferous tubules, or into the testis, and most 
preferably into the vas efferens or vasa efferentia using, for example, a micropipette and a 
picopump delivering a precise measured volume under controlled amounts of pressure. The 
25 modified germ cells differentiate in their own milieu. Progeny animals exhibiting the 
nucleic acid's integration into its germ cells (i.e., transgenic animals) are selected. The 
selected progeny can then be mated, or their sperm utilized for insemination or in vitro 
fertilization, to produce further generations of transgenic progeny or for microinjection into 
isolated oocytes. 

30 In another preferred embodiment, the invention relates to an in vitro method wherein 

male germ cells are obtained or collected from a donor male avian, by any means known in 
the art such as, for example, transection of the testes. The male germ cells are then exposed 
to a gene delivery mixture, preferably within several hours of collection, or cryopreserved 
for later use. When the male germ cells are obtained from the donor avian by transection of 

35 the testes, the cells can be incubated in an enzyme mixture known for gently breaking up the 
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tissue include, but are no, .united to, pancreauc coUagenase type I, P-creahc 

DNAsetypel, as well as bovine serum atoumin andamodffiedDr^Mmedrum. Ate 
^t^^eenbepiacedinenn^rned^snobaaDM^o^ 
andp 1 lonacnln^d i sbforgenefic m odmeadonbyexposu re «oagenedebveryanxnae. 

to otber embodiment a transgene ean be incorporated into an avran sperm by 
Upofeetion, electroporaSon, restriction enzyme mediated integration (REMI), 
tatraeytoplasmio sperm injectton (ICS!) or similar methods. 



In a preferred embodiment, a transgene is incorporated into an avian sperm by 
Uposomes. The male genn cells, which may be hhao. and viable spermatozoa or the non- 
viable heads thereofimay be transfer to a recipient oocyte using Uposomo-medrated 
delivery PCT Publicadou WO 87/05325, which is incorporated by reference hetem m «s 
,5 enur^y.'discloses a method of transferring organic and/o, inorganic materia! into sp«m or 
gglsbyusingfiposomes.Tbehete.logousnuc.eioaddeanalsobemco^dmtoa 

Reprol Develop. 30: 194-200; Nakanishi rfW WB. ««"• 36 ' 258 " 

261). 

20 

of exogenous DNA fragments by cultured cells. Enhancement of nuclear uptake of toe 
25 hetenjogous DNA win promote eariier chromosomal integration of toe exogeoousDNA 
molecules, tons reducing me degree of generic mosaicism observed » riansgemo avan 

one embodiment toe male geun cells is placed in a cuvette and a solution of the 
— c nucleic acid coding toe protem of interest is added. A direct current pulse . 
30 dischLged in toe cuveue suspension. The current pulse creates temporary, short-hved pores 
in toe cell membrane and allow toe male germ cells to take up toe transgene wmle only 
slightly compromising cell viability. More description on toe use of election to 
incorporate DNA can be found in Gagne e, al, 1991, Mot. Reprod Develop. 29: 6-15, 
which is incorporated herein by reference in its entirety. 



35 
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Restriction Enzyme Mediated Integration (REMI) 

In yet another preferred embodiment, a transgene is incorporated into an avian sperm 
by restriction enzyme mediated integration (REMI). The heterologous nucleic acid to be 
integrated into, for example, the sperm nuclear DNA is converted to a linear double 
5 stranded DNA possessing single-stranded cohesive ends by contacting the heterologous 
DNA with a type n restriction enzyme that upon scission, generates such ends. The nucleic 
acid to be cut can be a circular nucleic acid such as in a piasmid or a viral vector or a linear 
nucleic acid that possesses at least one recognition and cutting site outside of the genes or 
regulatory regions critical to the desired post-integration function of the nucleic acid, and no 
1 0 recognition and cutting sites within the critical regions. 

Alternatively the heterologous DNA to be integrated into the sperm nuclear DNA 
can be prepared by chemically and/or enzymatically adding cohesive ends to a linear DNA. 
The added cohesive ends must be able to hybridize to the cohesive ends characteristic of a 
nucleic acid cleaved by a type II restriction endonuclease. Alternatively, the cohesive ends 
15 can be added by combining the methods based on type D restriction enzyme cutting and 
chemical and/or enzymatic addition. It is also within the scope of the present invention for 
- the linearized nucleic acid to have one end that is a blunt end without unpaired nucleotides. 
Such blunt ends can be generated by restriction endonuclease digestion, exonuclease 
digestion of cohesive ends or fill-in of cohesive ends by polynucleotide synthesis, using 
20 methods as-deseribedy for example, in Sambrook et al, (supra), incorporated herein by 
reference in its entirety. 

It is also to be understood that a nucleic acid to be delivered to a recipient cell may 
be cleaved with two different restriction endonucleases that may generate the same or 
different cohesive termini, or at least one blunt-end terminus. Neither restriction 
25 endonucleases will have a recognition site within the nucleic acid sequence required to be a 
transgene in the recipient cell. 

When a restriction endonuclease is used to cleave the genomic nucleic acid of the 
recipient cell, the endonuclease may be co-delivered to the recipient cell such as a sperm 
cell with the heterologous nucleic acid, or sequentially delivered If a nucleic acid is 
30 cleaved with at least two restriction endonucleases, thereby generating at least one cohesive 
terminus, the at least two endonucleases may be delivered to a recipient cell either together 
or sequentially. The transfected nucleic acid may be mixed with at least one of the 
endonucleases or delivered to a recipient cell before or after at least one endonuclease is 
delivered thereto. 
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At least one terminus of a linearized nucleic acid to be delivered to a recipient cell 
may be a blunt end terminus, generated by endonuclease cleavage, chemical synthesis, 
enzyme directed nucleic acid digestion or synthesis, or any combination thereof. A 
recipient cell genome such as a sperm cell genome, may therefore be cleaved before, during 
5 or after delivery of the linearized nucleic acid to the cell, by delivery of a blunt-end 

generating restriction endonuclease to the recipient cell, or by radiation-induced cleavage. 
Suitable radiations that may be applied to, for example, a sperm cell include, but are not 
limited to, gamma radiation, x-rays, ultraviolet light and ultrasound. The dose and duration 
of the radiation applied to a cell sample are determined for each sample, for levels of 
1 0 cleavage that will allow integration of the transfected nucleic acid into the cell genome, 
while mamtaining viability of the cells for use in artificial insemination or recolonization of 
an avian testes. Viability of a recipient sperm may not be required when the transfected 
sperm are delivered to a recipient avian oocyte by such procedures as ICSI or CHICSI™. 
Cleavage of the genomic nucleic acid by irradiation or ultrasound can be either before, 
1 5 during or after delivery of the heterologous nucleic acid to the recipient cell. 

While not wishing to be bound by any one theory, the transfected nucleic acid may 
be integrated into a cleavage site of the genomic nucleic acid. Integration may be facilitated 
by the cohesive ends on the heterologous nucleic acid that hybridize to the like cohesive 
ends of the cleaved genomic nucleic acid. The integrated heterologous nucleic acid will 
20 then replicate and segregate with the genome of the recipient cell. 

Alternatively, the heterologous nucleic acid may not be integrated into a recipient 
genome, but will remain as an extrachromosomal episome. The heterologous nucleic acid 
of the present invention may circularize by hybridization of the cohesive ends of the nucleic 
acid, rather than be integrated into the genome. When the heterologous nucleic acid 
25 comprises any natural or synthetic origin of replication {ori element) the nucleic acid will be 
capable of replicating independently of the recipient genome. In one embodiment of the 
present invention the ori site included with a heterologous nucleic acid is derived from the 
SV40 virus. Episomal replication and segregation of daughter copies of the episome is 
facilitated by the linearized viral ori site and/or a centromere isolated from, for example, a 
30 chicken chromosome, thereby generating a chicken artificial chromosome. In another 
embodiment, the linearized heterologous nucleic acid will not be integrated into the genome 
of the recipient cell but remain as a separate unit that, because of a centromeric structure 
incorporated therein, will segregate into daughter cells during mitotic division. In this case, 
the unincorporated episomal heterologous nucleic acid is a chicken artificial chromosome 
35 (CAC). 
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The REM! method for stably integrating heterologous DNA into the genomic DNA 
of a recipient cell is described by Shemesh et al in PCT Publication No. WO 99/42569 and 
incorporated herein by reference in its entirety. This REM method comprises in part an 
adaptation of Ibe REMI technique disclosed by Schiest and Petes (Proc. Nat. Acad. Sci. 

5 U.S.A. 88, 7585-7589 (1991)) and Kuspa and Loomis (Proc. Nat. Acad. Sci. U.S.A., 89, 
8803-8807 (1992)) both incorporated herein by reference in their entireties. 

In preferred embodiments, the avian sperm are irradiated before being exposed to 
gene delivery mixture or having a transgene incorporated therein. The male germ cells can 
be irradiated with a suitable dose of gamma irradiation, preferably, 1 Gy, 2 Gy, 3 Gy, 4 Gy, 

10 5 Gy, 6 Gy, 7 Gy, 8 Gy, 9 Gy, 10 Gy, 1 1 Gy, 12 Gy, 15 Gy or 20 Gy, without compromising 
the viability and/or mobility of the sperms. (See Wooster et al, 1977, Can. J. Genet. Cytol. 
19: 437-446). 

Whether employed in the in vivo, in situ or in vitro method, the gene delivery 
mixture, once in contact with the male germ cells, facilitates the uptake and transport of 
15 heterologous genetic material into the appropriate cell location for integration into the 
genome and expression. A number of known gene delivery methods can be used for the 
uptake of nucleic acid sequences into the cell and facilitate the integration of the 
heterologous nucleic acid into the genome of the recipient cell. Such methods include, but 
are not limited to viral vectors, liposomes, electroporation, REMI, and ICSI. 
20 A gene delivery mixture suitable for use in the in vivo, in situ or in vitro methods of 

sperm-mediated transfection comprises a nucleic acid encoding a desired trait or product, 
and a suitable promoter sequence such as, for example, a tissue-specific promoter, or an 
IRES. The transgenic nucleic acids of the present invention may further comprise an origin 
of replication. For example, an origin of replication may be the SV40 ori, or a centromere 
25 derived from the chicken. A linear nucleic acid may further comprise a telomere at one or 
both ends of the nucleic acid. 

Optionally, agents that increase the uptake of, or comprise non-eukaryotic viral 
vectors, e.g., plasmids, BACs, YACs, etc., the nucleic acid sequence, such as liposomes, 
retroviral vectors, adenoviral vectors, adenovirus enhanced gene delivery systems, or 
30 combmationsmereofmaybemcludedmmegenedeUveryniixture. A reporter construct, 
including a genetic selection marker, such as the gene encoding for Green Fluorescent 
Protein, may also be added to the gene delivery mixture. Targeting molecules, such as c-kit 
ligand, can be added to the gene delivery mixture to enhance the transfer of genetic material 
into the male germ cell. An immunosuppressing agent, such as cyclosporin or a 
35 corticosteroid may also be added to the gene delivery mixture as known in the art. 
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Any of a number of commercially available gene delivery mixtures can be used, to 
which the polynucleotide encoding a desire trait or product is further admixed. The final 
gene delivery mixture comprising the polynucleotide can then be admixed with the male 
gamete cells and allowed to interact for a period of between about 2 hours to about 16 
5 hours, at a temperature of about 33 °C to about 37°C. After this period, the cells are 

preferably placed at a lower temperature of about 33 °C to about 34°C, for about 4 hours to 
about 20 hours, preferably about 16 to about 18 hrs. 

Isolating and/or selecting genetically transgenic germ cells (and transgenic somatic 
cells, and of transgenic vertebrates) is by any suitable means, such as, but not limited to, 
10 physiological and/or morphological phenotypes of interest using any suitable means, such as 
biochemical, enzymatic, immunochemical, histologic, electrophysiologic, biometric or like 
methods, and analysis of cellular nucleic acids, for example the presence or absence of 
specific DNAs or RNAs of interest using conventional molecular biological techniques, 
including hybridization analysis, nucleic acid amplification including, but not limited to, 
1 5 polymerase chain reaction, transcription-mediated amplification, reverse transcriptase- 
mediated ligase chain reaction, and/or electrophoretic technologies. 

One preferred method of isolating or selecting male germ cell populations comprises 
ob tainin g specific male germ cell populations, such as spermatogonia, from a mixed 
population of testicular cells by extrusion of the cells from the seminiferous tubules and 
20 enzyme digestion. The spermatogonia, or other male germ cell populations, can be isolated 
from a mixed cell population by methods such as the utilization of a promoter sequence that 
is specifically or selectively active in cycling male germ line stem cell populations. Suitable 
promoters include B-Myb or a specific promoter, such as the c-kit promoter region, c-raf-1 
promoter, ATM (ataxia-telangiectasia) promoter, vasa promoter, RBM (ribosome binding 
25 motif) promoter, DAZ (deleted in azoospermia) promoter, XRCC-1 promoter, HSP 90 (heat 
shock gene) promoter, cyclin Al promoter, or FRMI (from Fragile X site) promoter and the 
like. A selected promoter may be linked to a reporter construct, for example, a construct 
comprising a gene encoding Green Fluorescent Protein (or EGFP), Yellow Fluorescent 
Protein, Blue Fluorescent Protein, a phycobiliprotein, such as phycoerythrin or phycocyanin, 
30 or any other protein which fluoresces under suitable wave-lengths of light, or encoding a 
light-emitting protein, such as luciferase or apoaequorin. The unique promoter sequences 
drive the expression of the reporter construct only during specific stages of male germ cell 
development (eg., Mailer etal, 1999, J. Biol. Chem. 276(16), 11220-28; Schrans-Stassen 
et al, 1999, Endocrinology 140, 5894-5900, both of which are incorporated herein by 
35 reference in their entireties). In the case of a fluorescent reporter construct, the cells can be 
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sorted with the aid of, for example, a FACS set at the appropriate wavelength(s), or they can 
be selected by chemical methods. 

Male germ cells that have the DNA modified in the desired manner are isolated or 
selected, and transferred to the testis of a suitable recipient avian, preferably the donor male 
5 avian of the male germ cells. Further selection can be attempted after biopsy of one or both 
of the recipient male's testes, or after examination of the animal's ejaculate amplified by the 
polymerase chain reaction to confirm that the desired nucleic acid sequence had been 
incorporated. 

The genetically modified germ cells isolated or selected as described above are 
10 transferred to a testis of a suitable male avian, preferably a chicken, that can be, but need not 
be, the same donor animal. Before transferring the genetically modified male germ cells to 
the recipient animal, the testes of the recipient are depopulated of endogenous germ cells, 
thereby facilitating the colonization of the recipient testis by the genetically modified germ 
cells. Depopulation of the testis has commonly been accomplished by exposing the whole 
15 animal to gamma irradiation or by localized irradiation of the testis. The basic rigid 

architecture of the gonad should not be destroyed, nor significantly damaged. Disruption of 
tubules may lead to impaired transport of testicular sperm and result in infertility. Sertoli 
cells should not be irreversibly damaged, as they provide a base for development of the 
germ cells during maturation, and for preventing the host immune defense system from 
20 destroying grafted foreign spermatogonia. 

Suitable denuding methods, include irradiation by gamma-rays, x-rays, ultrasound, 
ultraviolet light, by chemical treatment, by means of infectious agents such as viruses, or by 
autoimmune depletion or by combinations thereof, preferably by a combined treatment of 
me vertebrate with an alkylating agent and gamma irradiation as taught in WO 00/69257, 
25 incorporated herein by reference in its entirety. 

Gamma radiation-induced spermatogonial degeneration probably related to the 
process of apoptosis. (Hasegawaerai, 1998, Radiat. Res. 149:263-70). Alternatively, a 
composition containing an alkylating agent such as busulfan (MYLERAN™) can be used to 
depopulate. (Jiang F.X., 1998, Anat. Embryol. 198(1): 53-61; Russell and Brinster, 1996, J. 
30 Androl. 17(6): 615-27; Boujrad et al, 1995, Andrologia 27(4): 223-28; Linder et al, 1992, 
Reprod. Toxicol. 6(6): 491-505; Kasuga and Takahashi, 1986, Endocrinol Jpn 33( 1): 105- 
1 5). Other cytotoxic alkylating agent, may be, but is not limited to, chlorambucil, 
cyclophosphamide, melphalan, or ethyl ethanesulfonic acid, and may be combined with 
gamma irradiation, to be administered in either sequence. 

35 
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The dose of the alkylating agent and the dose of gamma radiation are in an amount 
sufficient to substantially depopulate the testis. The alkylating agent can be adrninistered by 
any pharmaceutically acceptable delivery system, including but not limited to, 
intraperitoneal, intravenous, or intramuscular injection, intravenous drip, implant, 
5 transdermal or transmucosal delivery systems. 

The isolated or selected genetically modified germ cells are transferred into the 
recipient testis by direct injection using a suitable micropipette. Support cells, such as 
Leydig or Sertoli cells, that can be unmodified or genetically modified, can be transferred to 
a recipient testis along with the modified germ cells. 

10 

5.1.1.3 DELIVERY OF TRANSGENIC SPERM TO OOCYTES 

The transfected male avian germ cells may be used to deliver a heterologous nucleic 
acid to an avian oocyte by implanting the transfected male germ cells such as transfected 
spermatogonia! precursor cells, into the testicular tissue of host male birds previously 

15 denuded of viable spermatogonia! cells or sperm. The implanted transfected male avian 
germ cells may colonize the testicular tissue, proliferate therein, and generate viable 
transgenic sperm that may be harvested for use in artificial insemination procedures, or 
transferred to a recipient oocyte by natural coitus. 

In certain embodiments, therefore, the transgenic avian may be produced by the 

20 sperm-mediated transfer of at least one heterologous transgene. The transgene may be 
incorporated into the genomic nucleic acid of a spermatozoon cell or a precursor thereof, so 
that a genetically modified avian sperm is produced by the male avian. Breeding the male 
avian with a female of its species will generate a transgenic progeny carrying the at least one 

transgene in its genome. 

25 A union of male and female gametes to form a transgenic zygote is brought about by 

copulation of the male and female vertebrates of the same species, or by in vitro or in vivo 
artificial means. If artificial means are chosen, then incorporating into the genome a genetic 
selection marker that is expressed in male germ cells is particularly useful. 

Suitable artificial means include, but are not limited to, artificial insemination, in 

30 vitro fertilization (TVF) and/or other artificial reproductive technologies, such as 

intracytoplasmic sperm injection (ICSI), subzonal insemination (SUZI), or partial zona 
dissection (PZD). Also others, such as cloning and embryo transfer, cloning and embryo 
splitting, and the like, can be employed. 

In a preferred embodiment, a transgene is incorporated into an avian sperm by 

35 intracytoplasmic sperm injection (ICSI). The male germ cells, which may be intact and 
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v« eS p^ozoa,o, to no„.v»leheads^maybe«eo.edm»te 

I^Lclcnpe^nn— 
> microinjection of an opaque avian egg. artificial 
The transgenic vertebrate progeny can, in turn, be bred by natural mating, artificial 

insemination, or by mv^^ , > 

Zl^l sucbasmtracytoplasnn^ 

• .tWCHICSI™) subzonal insemination (SUZI), or partial zona dissection 

inLgeLceUsoffa^progeoyand^entge— teeof. m add,«on,«be 
g^oma.e^^^obepr.seotinceUsofteprogenyoteAan germ cells,,.., 

fertiltzed oocyte to a surrogate mote, especially a female chicken, for the continued 
^^lopn^ttansgeniczygo*. ^cMctans.thedeve.opede^ 
^dasahaM^neggte^ha^aaaviableehictmen^he^^onctao 

20 ^.Igeniehe.elgonsnneleieacidinal.on^.ls. ^*>~**~°*" 

^ is e^mal wi* tespec. ,o me genome of me nnnsgenic zygote 
episomalnucleicacidenmprisesaennttomenotody.mostilfno.aJl.ofmeenUs fthe 

aSmosMc^expressionofmee.ogenous^gene^onlyccc.msome.bn.no.all 
cells or tissues of the transgenic animal. 

51 2 BREEDING AND MAINTENANCE OF TRANSGENIC AVIAN 

Another aspect of the present invention is a transgenic avian produced by the 
30 methods of the present invention and producing aheterologous polypeptide in an egg, 
wherein the transgenic avian comprises at least one heterologous nucleic acid sequence 
eX*epo^ 
of an avian egg by a female of the avian. 

The invention relates to a method of producing transgenic avians that express 
35 significant quantities of useful heterologous proteins, e.g., therapeutic and diagnostic 
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proteins, including immunoglobulins, industrially useful proteins and other biologies etc. in 
the avian egg white. The heterologous protein can then be readily purified from the avian 
egg. The methods of the invention provide improved efficiencies of transgenesis, 
transmission of the transgene and/or level of heterologous protein expression. Another 
5 aspect of the invention is a method of producing a transgenic avian capable of expressing a 
heterologous protein. Therefore, the present invention relates to methods of producing 
transgenic avians, preferably chickens, wherein the incorporated transgene may be 
expressed as a constituent protein of the white of a hard-shell egg. 

Although the genetic material is originally inserted solely into the germ cells of a 
10 parent animal, it will ultimately be present in the germ cells of future progeny and 

subsequent generations thereof. In addition, the genetic material will also be present in cells 
of the progeny other than germ cells, i.e., somatic cells. 

Using the methods of the invention for producing transgenic avians, particularly 
methods using vectors that are not derived from eukaryotic viruses, and, preferably, the 
1 5 methods of cytoplasmic micro-injection described herein, the level of mosaicism of the 
transgene (percentage of cells containing the transgene) in avians hatched from 
microinjected embryos (i.e., the G 0 s) is greater than 5%, 10%, 25%, 50%, 75% or 90%, or is 
the equivalent of one copy per one genome, two genomes, five genomes, seven genomes or 
eight genomes, as determined by any number of techniques known in the art and described 
20 infra. 

In additional particular embodiments, the percentage of GOs that transmit the 
transgene to progeny (Gls) is greater than 5%, preferably, greater than 10%, 20%, 30%, 
40%, and, most preferably, greater than 50%, 60%, 70%, 80%, 90%. In other embodiments, 
the transgene is detected in 10%, 20%, 30%, 40%, and most preferably, greater than 50%, 
25 60%, 70%, 80%, 90% of chicks hatching from embryos into which nucleic acids have been 
introduced using methods of the invention. 



5.2 VECTORS 

A variety of vectors useful in carrying out the methods of the present invention are 
30 described herein. These vectors may be used for stable introduction of a selected 

heterologous polypeptide-coding sequence (and/or regulatory sequences) into the genome of 
an avian, in particular, to generate transgenic avians that produce exogenous proteins in 
specific tissues of an avian, and in the oviduct in particular, or in the serum of an avian. In 
still further embodiments, the vectors are used in methods to produce avian eggs containing 
35 exogenous protein. 
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In particular embodiments, preferably for use in the sperm-mediated transgenesis 
methods described herein, the vectors of the invention are not derived from eukaryotic viral 
vectors or retroviral vectors (except in certain embodiments for containing eukaryotic viral 
regulatory elements such as promoters, origins of replication, etc). In particular 

5 embodiments, the vector is not an REV, ALV or MuLV vector. In particular, useful vectors 
include, bacteriophages such as lambda derivatives, such as Xgtl 1, Agt WES.tB, Charon 4, 
and plasmid vectors such as pBR322, pBR325, pACYC177, pACYC184, pUC8, pUC9, 
pUC18, pUC19, pLG339, P R290, pKC37, pKClOl, SV40, pBluescript® H SK +/- or KS 
+/- (see "Stratagene Cloning Systems" Catalog (1993) from STRATAGENE®, La Jolla, 

10 Calif., which is hereby incorporated by reference), pQE, pIH821, pGEX, pET series (see 
Studier, F.W. et. al., 1990, "Use of T7 RNA Polymerase to Direct Expression of Cloned 
Genes" Gene Expression Technology 185, which is hereby incorporated by reference) and 
any derivatives thereof, cosmid vectors and, in preferred embodiments, artificial 
chromosomes, such as, but not limited to, YACs, BACs, BBPACs or PACs. Such artificial 

15 chromosomes are useful in that a large nucleic acid insert can be propagated and introduced 
into the avian cell. 

In other particular embodiments, as detailed above in section 5.2, infra, the vectors 
of the invention are derived from eukaryotic viruses, preferably avian viruses, and can be 
replication competent or, preferably, replication deficient. In particular embodiments, the 

20 vectors are derived from REV, ALV or MuLV. Nucleic acid sequences or derivative or 
truncated variants thereof, may be introduced into viruses such as vaccinia virus. Methods 
for making a viral recombinant vector useful for expressing a protein under the control of 
the lysozyme promoter are analogous to the methods disclosed in U.S. Patent Nos. 
4,603,112; 4,769,330; 5,174,993; 5,505,941; 5,338,683; 5,494,807; 4,722,848; Paoletti, E, 

25 1996, Proc. Natl. Acad. Sci. 93: 11349-11353; Moss, 1996, Proc. Natl. Acad Sci. 93: 
11341-11348; Roizman, 1996, Proc. Natl. Acad Sci. 93: 11307-11302; Frolov etal., 1996, 
Proc. Natl. Acad Sci. 93: 11371-11377; Grunhaus etal., 1993, Seminars in Virology 3: 
237-252 and U.S. Patent Nos. 5,591,639; 5,589,466; and 5,580,859 relating to DNA 
expression vectors, inter alia; the contents of which are incorporated herein by reference in 

30 their entireties. 

Recombinant viruses can also be generated by transfection of plasmids into cells 
infected with virus. 

Preferably, vectors can replicate (i.e., have a bacterial origin of replication) and be 
manipulated in bacteria (or yeast) and can then be introduced into avian cells. Preferably, 
35 the vector comprises a marker that is selectable and/or detectable in bacteria or yeast cells 
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and, preferably, also in avian cells, such markers include, but are not limited to, Amp r , tef , 
LacZ, etc. Preferably, such vectors can accommodate (i. e. , can be used to introduce into 
cells and replicate) large pieces of DNA such as genomic sequences, for example, large 
pieces of DNA consisting of at least 25 kb, 50 kb, 75 kb, 100 kb, 150 kb, 200 kb or 250 kb, 

5 such as BACs, YACs, cosmids, etc. 

The insertion of a DNA fragment into a vector can, for example, be accomplished by 
ligating the DNA fragment into a vector that has complementary cohesive termini. 
However, if the complementary restriction sites used to fragment the DNA are not present 
in the vector, the ends of the DNA molecules may be enzymatically modified. 

10 Alternatively, any site desired may be produced by ligating nucleotide sequences flinkers) 
onto the DNA termini; these ligated linkers may comprise specific chemically synthesized 
oligonucleotides encoding restriction endonuclease recognition sequences. In an alternative 
method, the cleaved vector and the transgene may be modified by homopolymeric tailing. 
The vector can be cloned using methods known in the art, e.g.,by the methods 

15 disclosed in Sambrook et aL, (supra); Ausubel et al, 1989, Current Protocols in Molecular 
Biology, Green Publishing Associates and Wiley Interscience, N. Y., both of which are 
hereby incorporated by reference in their entireties. Preferably, the vectors contain cloning 
sites, for example, restriction enzyme sites that are unique in the sequence of the vector and 
insertion of a sequence at that site would not disrupt an essential vector function, such as 

20 replication. 

As discussed above, vectors used in certain methods of the invention preferably can 
accommodate, and in certain embodiments comprise, large pieces of heterologous DNA 
such as genomic sequences, particularly avian genomic sequences. Such vectors can 
contain an entire genomic locus, or at least sufficient sequence to confer endogenous 

25 regulatory expression pattern, e.g., high level of expression in the magnum characteristic of 
ovalbumin, lysozyme, ovomucoid, ovotransferrin, conalbumin, and ovomucin, etc, and to 
insulate the expression of the transgene sequences from the effect of regulatory sequences 
surrounding the site of integration of the transgene in the genome. Accordingly, as detailed 
below, in preferred embodiments, the transgene is inserted in an entire genomic loci or 

30 significant portion thereof. 

To manipulate large genomic sequences contained in, for example, a BAC, 
nucleotide sequences coding for the heterologous protein to be expressed and/or other 
regulatory elements may be inserted into the BAC by directed homologous recombination in 
bacteria, e.g., the methods of Heintz WO 98/59060; Heintz et al 9 WO 01/05962; Yang et 
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al ml, Nature BiCecHnol 1 5: 859-865; Yang e, al, 1999, Nature Genetics 22: 327-35; 

AKemadvely, to BAC can also be engineered or modrfied by E-T clomng as 
describedbyMuymrsern/. (,999, MrcWc^fc to. 27(6): 1555-57, hrcorporatedhere.n 
1 byref er^mi<senth«y). U S ing t he« m e*ods,spe, ffl eDNAmayl«eng m eered 1 n^a 

1998,*. 0~t 20(2): 123-28; incorporated herein byefaencm* a***). 
Homologonarecombnrndoncanbepe^b^eenaPCRWfl^by^ 

10 ^ogyannaandanendogenonstototrecipienrsucbasaBAC. Using uusmedrod, 

homologous recombtoarion is not limited by die disposition of restriction endonuclease 
LJ^ortosizeofto^DNA.ABACcnnb.modmedinitsbos.s.ramoanrg 

fLonalcomrterparts of phage lambda (Mtryrers a,./., 1999, ***** *«■ » 
15 ,555-57). F,eforab.y,aBACisnu^ rf byrecombh.don™.haPCRprodno,oontomrg 
homology arms ranging from 27-60 bp. In a specific embodiment homology arms are 50 

" ta Mother embodiment a transgene is inserted into a yeaat artificial chromosome 
(YAQ (Burke a, at, 1987, ScU.ce 236: 806-12; and Peterson e, aL, 1997, JVa^Js Gene,. 
20 B-ei.bomofwmchammcoreoratedbymferancehemininmeirenhretres). 

' ta other embodiments, the transgene is inserted into another vector developed for the 
cloning of large segments of genomic DNA, snch as a coamid or bacteriophage PI 
(Sternberg at* 1990, Proc. HA Acad. Set. USAiT. 103-07). Theappmxrmate 
maximum insert size is 30-35 kb for cosmids and 100 kb for bacteriophage PI. ^another 
25 embodiment, the transgene is inserted into a P-l derived artificial chromoaoma ffAC) 
(Mejiaetal., 1997, Genome Res 7:179-186). The maximnm insert aiae .300 kb_ 

Vectors containing me appropriate heterologous sciences may be rdenufiedby any 
mettrod well knotvn in the art, for example, by sequencing, restriction mappmg, 
hvbridizarion, PCR amplification, etc. 
30 The vectors of the invention comprise one or more nucleotide sequences encoding a 

heterologous protein desired to be expressed in the transgenic avian, as well as regulatory 
elements such as promoters, enhancers, MARs, IRES's and other translation control 
elements, transcriptional termination elements, polyadenylation sequences, etc, as dxscussed 
infra In particular embodiments, the vector of the invention contains at least two 
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heavy and light chains of an immunoglobulin. 

toa pmfemedembodimen<,thenucleotide^^^ . 
protein is inserted into aU or a significant portion of a nucleic acid containing dte genonnc 

_ e.g., ovalbumin, lysozyme, ovomncoid, ovotransferrin, conalbunnn and 

In science of the endogenous gene genomic ^uence. Preferably, me M» 
10 genecIrgsequencehasitsownlRES. For descriptions of MS, see, ^ ac^e, 
1 1 990,^i to c to Sd.l5( 1 2):477-83 ; Jang«nl,1988,J.F,o ( . 62(8^2636-43, 

jl„nl 1990,^ 44(1-4)292-309; andMartinez-Satos, 1999, Curr.Opn 

incorporated by reference herein in their entireties. In another embodiment, tire 
IShe^.ogonsnm.einceKhngsee.nenceUinaertcnatmeyendofmeendogenon.gene 

coding sequence. In another preferred embodiment, the heterologous gene ccdrng 
sequences are inserted using 5' direct fusion wherein the heterologous gene , codmg 
sequences are inserted in-frame adjacent to the initial ATO sequence (or adjacent the 
nlotide sequence encoding the first two, force, four, free, six, seven or ergh, ammo acds) 
20 ofthe endogenous gene or replacing some or all of fee sequenceof ure endogenous gene 
coding sequence. In ye. another specific embodiment, the heterologous gene «*"■ 

sequenceandhasanindepeudentlEESsequence. 

The present invention further relates to nucleic acid vectors (preferably, not derrved 
25 from euharyotic virt^ excepti » certidn embodiments, for euharyofic viral promoters and/ 
or enhancers) and tranagenes inserted therein drat incorporate multiple !^ff^ 
encoding regions, wherein a firs, polypeptide-encoding region is operadvely lurked toa 
transcription promoter and a second polyrepnoe-encoding region is operahvely Imked to an 
IMS.Forexample.fhevecmrmaycontamcoduigsequencesforrwou.fferent 

sequences for all or a significant part of the genomic science for the gene from whrch the 
promoter driving expression ofthe transgeue is derived, and the heterologous protetn 
desired to be expressed (e.g„ a construct containing the genomic coding sequences, 
including introns, ofthe avian lysozyme gene when the avian lysozyme promoter rs used to 
35 drive expression ofthe transgeue, an IRES, and the coding sequence for the heterologous 
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protein desired to be expressed downstream (i.e., 3' on the RNA transcript of the IRES). 
Thus, in certain embodiments, the nucleic acid encoding the heterologous protein is 
introduced into the 5' untranslated or 3' untranslated regions of an endogenous gene, such as 
but not limited to, ovalbumin, lysozyme, ovomucoid, ovotransferrin, conalbumin, and 
5 ovomucin, with an IRES sequence directing translation of the heterologous sequence. 

Such nucleic acid constructs, when inserted into the genome of a bird and expressed 
therein, will generate individual polypeptides that may be post-translationally modified, for 
example, glycosylated or, in certain embodiments, form complexes, such as heterodimers 
with each other in the white of the avian egg. Alternatively, the expressed polypeptides may 
10 be isolated from an avian egg and combined in vitro, or expressed in a non-reproductive 
tissue such as serum. In other embodiments, for example, but not limited to, when 
expression of both heavy and light chains of an antibody is desired, two separate constructs, 
each containing a coding sequence for one of the heterologous proteins operably linked to a 
promoter (either the same or different promoters), are introduced by microinjection into 
1 5 cytoplasm of one or more embryonic cells and transgenic avians harboring both transgenes 
in their genomes and expressing both heterologous proteins are identified. Alternatively, 
two transgenic avians each containing one of the two heterologous proteins (e.g. , one 
transgenic avian having a transgene encoding the light chain of an antibody and a second 
transgenic avian having a transgene encoding the heavy chain of the antibody) can be bred 
20 to obtain an avian containing both transgenes in its germline and expressing both transgene 
encoded proteins, preferably in eggs. 

Recombinant expression vectors can be designed for the expression of the encoded 
proteins in eukaryotic cells. Useful vectors may comprise constitutive or inducible 
promoters to direct expression of either fusion or non-fusion proteins. With fusion vectors, 
25 a number of amino acids are usually added to the expressed target gene sequence such as, 
but not limited to, a protein sequence for thioredoxin, a polyhistidine, or any other amino 
acid sequence that facilitates purification of the expressed protein A proteolytic cleavage 
site may further be introduced at a site between the target recombinant protein and the 
fusion sequence. Additionally, a region of amino acids such as a polymeric histidine region 
30 may be introduced to allow binding of the fusion protein to metallic ions such as nickel 
bonded to a solid support, and thereby allow purification of the fusion protein. Once the 
fusion protein has been purified, the cleavage site allows the target recombinant protein to 
be separated from the fusion sequence. Enzymes suitable for use in cleaving the proteolytic 
cleavage site include, but are not limited to, Factor Xa and thrombin. Fusion expression 
35 vectors that may be useful in the present invention include pGex (AMRAD® Corp., 
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Melbourne, Australia), pWT5 (PHARMACIA®, Piscaaway, NJ) and pMAL (NEW 
ENGLAND BIOLABS®, Beverly, MA), fetog glutathione S-transferase, protein A, or 
maltose E binding protein, respectively, to ore target recombinant proton, 

Once a promoter and a nncleic acid encoding a heterologous protem of tire present 

cell Snch incorporation can be carried out by me various forms of transformation noted 

of me DNA of the present invention into a recipient cell may be by any suitable method 
such as, M no. limited to, viral tiansfer, electeoporation, gene gun insertion spenn- 
.0 ^atodtirmafertoanovam.micromiectionandti.elike. Smteblehos. cells mctade^. 

m „„, hmited to, bacteria, vinrs, yeas, mammdian ceUs, and the hke. to part.cu.ar, me 
^.mvcntioncontomplatosnrenseofrecipiemaviancells.snchascmclrencensor 

quail cells. „ „ 

Another aspect of me present invention, therefore, is a meted of expressing a 

^mbinan.DNAcomprismganavi.mtissne-specmcpmmoteroperab.y.ndcedtoanuoteto 
acid insert encoding a polypeptide and, optionaUy, a polyadenylation sqmal sequence, and 
entering the tiansfected cell in a medium suitable for expression of tire heterologous 
polypeptide under me oontiol of me avian lysozyme gene exposition contro mgtom 
20 Ye.anomeraspeo.oflhepresen.inventionlsaentayotieoeUc^sformedw.man 

nucleic acid insert comprises tire chicken lysozyme gene expression contio region, a 
nncleic acid ^encoding a human in«rferona2b and codonoptirmzed for cxpressronm 

25 an avian cell, and an SV40 polyadenylation sequence. 

banoti.eremhodunen.meti^fonnedcellisaquaUovidnc.cellandthen^c 

acid insertcompriseatirc artificial aviarpromoter co-storctMDOT (SEQIDNO.ill) 
operably linked to an Wcrferon-encoding sequence, as described in Example 23 below. 
' In yet another embodiment of the present invention, a quail oviduct cell is 

operably linked to an erytoopoietin (EPO>encoding nucleic acid, whemrn the tiansfected 
quail produces heterologous erythropoietin. 
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5.2.1 PROMOTERS 

The vectors of the invention contain promoters that function in avian cells, 
preferably, that are tissue-specific and, in preferred embodiments, direct expression in the 
magnum or serum or other tissue such that expressed proteins are deposited in eggs, more 
5 preferably, that are specific for expression in the magnum. Alternatively, the promoter 
directs expression of the protein in the serum of the transgenic avian. Introduction of the 
vectors of the invention, preferably, generate transgenics that express the heterologous 
protein in tubular gland cells where it is secreted into the oviduct lumen and deposited, e.g., 
into the white of an egg. In preferred embodiments, the promoter directs a level of 

10 expression of the heterologous protein in the egg white of eggs laid by GO and/or Gl chicks 
and/or their progeny that is greater than 5 ^g, 10 jig, 50 ng, 100 jig, 250 |xg 9 500 ^g or 750 
Hg, more preferably greater than 1 mg, 2 mg, 5 mg, 10 mg, 20 mg, 50 mg, 100 mg, 200 mg, 
500 mg, 700 mg, 1 gram, 2 grams, 3 grams, 4 grams or 5 grams. Such levels of expression 
can be obtained using the promoters of the invention. 

1 5 In preferred embodiments, the promoters of the invention are derived from genes 

that express proteins present in significant levels in the egg white and/or the serum. For 
example, the promoter comprises regions of an ovalbumin, lysozyme, ovomucoid, 
ovotransferrin, conalbumin or ovomucin promoter or any other promoter that directs 
expression of a gene in an avian, particularly in a specific tissue of interest, such as the 

20 magnum or in the serum. Alternative^ the promoter used in the expression vector may be 
derived from that of the lysozyme gene that is expressed in both the oviduct and 
macrophages. Portions of two or more of these, and other promoters that function in avians, 
may be combined to produce effective synthetic promoter. 

The promoter may optionally be a segment of the ovalbumin promoter region that is 

25 sufficiently large to direct expression of the coding sequence in the tubular gland cells. 
Other exemplary promoters include the promoter regions of the ovalbumin, lysozyme, 
ovomucoid, conalbumin, ovotransferrin or ovomucin genes (for example, but not limited to, 
as disclosed in co-pending United States Patent Application Nos. 09/922,549, filed August 
3, 2001 and 10/1 14,739, filed April 1, 2002, both entitled "Avian Lysozyme Promoter", by 

30 Rapp, and United States Patent Application No. 09/998,716, filed November 30, 2001, 
entitled "Ovomucoid Promoter and Methods of Use," by Harvey et al, all of which are 
incorporated by reference herein in their entireties). Alternatively, the promoter may be a 
promoter that is largely, but not entirely, specific to the magnum, such as the lysozyme 
promoter. Other suitable promoters may be artificial constructs such as a combination of 

35 nucleic acid regions derived from at least two avian gene promoters. One such embodiment 
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derived ton. the chicken ovomucin and ovotransfemn promoters, tacludtag 
tSL alters I— — - - «— * — ** ** 

the tubular gland cells of the magnum of the oviduct e3 *°' ^'^^ percent ofthe total 
Ovalbummiatte most ahnndantegg^pro^compnamg over 50 p^of^ 

Lin produced by the tabular gland cells, or about 4 grams of proteta per large Gmde A 

ST^»— geneandoverZOkbofeach^g-egionbavebeencon^d 

, Jr, 1978 mcNMJniSd. MM 75:2205-2209; Gannon era!., 1979, 

analyzed (Lar e< » ,1978, Pro t «. ^ ^ 

Nature 278:428-424; Roop e«rf.,1980, Leu ly.oJ o , 
a^progesterone^chinducetheaccumuladonofabo^ 

transcripts per tubular gland cell in the mature laymg hen (Palmiter, 1973, J. Art Ctot 
transcnpts per iudui g contams four 

lAcsofin 8270- Palmiter, 1975, Ce/74:189-iy/;. inc3u^6 
248.8260-8270, raimi , 2 ^.g.ofcb fiom the transcription 

20 DNAsel-hypersensitive sites centered at -0.25, -0.8, ^,ana 

DNAseinyp respectively. Promoters of the 

start site. These sites are calledHS-I, II ffl,an I . P* dHSOIV . 
invention may contain one, all, or a combination of HS-I, HS n, HS 
Hypersensitivity of HS-H and -HI are estrogen-induced, supportmg a role for these regions 
in hormone-induction of ovalbumin gene expression. 

30 XL.preLontataeabs.ceofhormonetHaekers^., "«.™^ 3 - 
nuclei suggesting a role in tissue-specific expression. HS-11 is termea 

JUuon. Itbtadsaprotataorp.otamcnmp.ex.mov.asaurp-l. 
35 es^andtu^overrupid.ytathepxes^ofey.lohexamrdeCDeaner./., 1996,Mol. 
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Cell. Biol. 16:2015-2024). Experiments using an explanted tubular gland cell culture 
system defined an additional set of factors that bind SDRE in a steroid-dependent manner, 
including a NFKB-like factor (Nordstrom era/., 1993, J Biol. Chem. 268:13193-13202; 
Schweers and Sanders, 1991, J. Biol. Chem. 266: 10490-10497). 

5 Less is known about the function of HS-IH and HS-IV. HS-HI contains a functional 

estrogen response element, and confers estrogen inducibility to either the ovalbumin 
proximal promoter or a heterologous promoter when co-transfected into HeLa cells with an 
estrogen receptor cDNA. These data imply that HS-HI may play a functional role in the 
overall regulation of the ovalbumin gene. Little is known about the function of HS-IV, 

1 0 except that it does not contain a functional estrogen-response element (Kato et al. , 1992, 
Cell 68: 731-742). 

In an alternative embodiment of the invention, transgenes containing constitutive 
promoters are used, but the transgenes are engineered so that expression of the transgene 
effectively becomes magnum-specific. Thus, a method for producing an exogenous protein 

1 5 in an avian oviduct provided by the present invention involves generating a transgenic avian 
having two transgenes in its tubular gland cells. One transgene comprises a first coding 
sequence operably linked to a constitutive promoter. The second transgene comprises a 
second coding sequence that is operably linked to a magnum-specific promoter, where 
expression of the first coding sequence is either directly or indirectly dependent upon the 

20 cellular presence of the protein expressed by the second coding sequence. 

Additional promoters useful in the present invention include inducible promoters, 
such as the tet operator and the metallothionein promoter which can be induced by 
treatment with tetracycline and zinc ions, respectively (Gossen et al , 1992, Proc. Natl. 
Acad. Sci. 89: 5547-5551 and Walden etal, 1987, Gene 61: 317-327; incorporated herein 

25 by reference in their entireties). 

5.2.1.1 CHICKEN LYSOZYME GENE EXPRESSION CONTROL 

REGION NUCLEIC ACID SEQUENCES 

The chicken lysozyme gene is highly expressed in the myeloid lineage of 
30 hematopoietic cells, and in the tubular glands of the mature hen oviduct (Hauser et al. , 
1981, Hematol. and Blood Transfusion 26: 175-178; Schutz et al., 1978, Cold Spring 
Harbor Symp. Quart. Biol. 42: 617-624) and is therefore a suitable candidate for an efficient 
promoter for heterologous protein production in transgenic animals. The regulatory region 
of the lysozyme locus extends over at least 12 kb of DNA 5' upstream of the transcription 
35 start site, and comprises a number of elements that have been individually isolated and 
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,0 522 MATRIX ATTACHMENT REGIONS 

b prcfeted embodiments of the invention, the vectors contain 
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Phi-Van, L. and Startling, W.H., 1996, Biochem. 35: 10735-10742). Deletion of a 1.32 kb 
or a 1 .45 kb halves region, each comprising half of a 5' MAR, reduces positional variation 
in the level of transgene expression (Phi- Van and Stratling, supra). 

The 5' matrix-associated region (5' MAR), located about -1 1 .7 kb upstream of the 
5 chicken lysozyme transcription start site, can increase the level of gene expression by 
limiting the positional effects exerted against a transgene (Phi-Van et al, 1988, supra). At 
least one other MAR is located 3' downstream of the protein encoding region. Although 
MAR nucleic acid sequences are conserved, little cross-hybridization is seen, indicating 
significant overall sequence variation. However, MARs of different species can interact 
10 with the nucleomatrices of heterologous species, to the extent that the chicken lysozyme 
MAR can associate with the plant tobacco nucleomatrix as well as that of the chicken 
oviduct cells (Mlynarona et al, 1994, Cell 6: 417-426; von Kries et al, 1990, Nucleic Acids 
Res. 18: 3881-3885). 

Gene expression must be considered not only from the perspective of cis-regulatory 
1 5 elements associated with a gene, and their interactions with trans-acting elements, but also 
with regard to the genetic environment in which they are located. Chromosomal positioning 
effects (CPEs), therefore, are the variations in levels of transgene expression associated with 
different locations of the transgene within the recipient genome. An important factor 
governing CPE upon the level of transgene expression is the chromatin structure around a 
20 transgene, and how it cooperates with the cis-regulatory elements. The cis-elements of the 
lysozyme locus are confined within a single chromatin domain (Bonifer et al, 1996, supra; 
Sippel et al, pgs. 133-147 in Eckstein F. & Lilley D.MJ. (eds), "Nucleic Acids and 
Molecular Biology 55 , Vol. 3, 1989, Springer. 

The lysozyme promoter region of chicken is active when transfected into mouse 
25 fibroblast cells and linked to a reporter gene such as the bacterial chloramphenicol 
acetyltransferase (CAT) gene. The promoter element is also effective when transiently 
transfected into chicken promacrophage cells. In each case, however, the presence of a 5 ! 
MAR element increased positional independency of the level of transcription (Stief et al , 
1989, Nature 341 : 343-345; Sippel et al, pgs. 257 - 265 in Houdebine L.M. (ed), 
30 "Transgenic Animals: Generation and Use"). 

The ability to direct the insertion of a transgene into a site in the genome of an 
animal where the positional effect is limited offers predictability of results during the 
development of a desired transgenic animal, and increased yields of the expressed product. 
Sippel and Steif disclose, in U.S. Patent No. 5,73 1,178, which is incorporated by reference 
35 herein in its entirety, methods to increase the expression of genes introduced into eukaryotic 
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5 23 CODON-OPTIM1ZED GENE EXPRESSION 
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recipient cell for expression therein, the sequence of the nuclerc acrd sequence may be 

20 exatnple, if tire ^logons nucleic acid is tinted into a eecip en. chrckenc* 
sequleoffce expressed nucleic acid insert is optimized fcr cmcken codo. o*g, Ttas 

L nucleic acid sequences encoding flae proteins ovalbumin, lysozyme ovomuco rd, 
25 Irrrmsfertm.conalbumm.artdovomucmofcmcxen Brieve DNA 

Ml protein ma, be optimized using the BACKTRANSLATE® program of the Wrsconsm 
TacCc version 9, (Genetics Computer Oroup, me, Mauison, WI) wrih a codonusage 
table compiled from me chicken (OA. g« ovalbumin, lysozyme, ovomucord, 
ovotraosferrin, conalbumin, and ovomucin proteins. The template and prrrner 
30 oligonucleotides are then amplified, by any means known in the art, mcludmg but no, 
limitedmPCRwimr>tx.lvmernae(STl!ATAGENE®,UJollaCA). 

t one exemplary embodiment of a heterolo^us nucleic acid for use by me methods 
of the present invention, a nucleic acid msert encoding tire human interferon o2b 
peptide optimized for codon-nsage by the chicken is microinjected mm me cytoplasm 
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of a stage 1 embryo. Optimization of the sequence for codon usage is useful in elevating the 
level of translation in avian eggs. 

It is contemplated to be within the scope of the present invention for any nucleic 
acid encoding a polypeptide to be optimized for expression in avian cells. It is further 
5 contemplated that the codon usage may be optimized for a particular avian species used as a 
source of the host cells. In one embodiment of the present invention, the heterologous 
polypeptide is encoded using the codon-usage of a chicken. 

5.2.4 SPECIFIC VECTORS OF THE INVENTION 

10 In a preferred embodiment, a transgene of the invention comprises a chicken, or 

other avian, lysozyme control region sequence which directs expression of the coding 
sequence within the transgene. A series of PCR amplifications of template chicken 
genomic DNA are used to isolate the gene expression control region of the chicken 
lysozyme locus. Two amplification reactions used the PCR primer sets 5pLMAR2 (5'- 

15 TGCCGCCTTCTTTGATATTC-3 ') (SEQ ED NO: 1) and LE-6.1kbrevl (5'- 
TTGGTGGTAAGGCCTTTTTG-3') (SEQ ID NO: 2) (Set 1) and lys-6.1 (5'- 
CTGGC AAGCTGTC AAAAAC A-3 ') (SEQ ID NO: 3) and LysElRev (5 f - 
CAGCTCACATCGTCCAAAGA-3 ! ) (SEQ ID NO: 4) (Set 2). The amplified PCR 
products were united as a contiguous isolated nucleic acid by a third PCR amplification step 

20 with the primers SEQ ID NOS: 1 and 4. 

The isolated PCR-amplified product, comprising about 12 kb of the nucleic acid 
region 5' upstream of the native chicken lysozyme gene locus, was cloned into the plasmid 
pCMV-LysSPIFNMM. pCMV-LysSPIFNMM comprises a modified nucleic acid insert 
encoding a human interferon a2b sequence and an SV40 polyadenylation signal sequence 

25 (SEQ ED NO: 8) 3 ' downstream of the interferon encoding nucleic acid. The sequence SEQ 
ID NO: 5 of the nucleic acid insert encoding human interferon a2b was in accordance with 
avian cell codon usage, as determined from the nucleotide sequences encoding chicken 
ovalbumin, lysozyme, ovomucoid, ovotransferrin, conalbumin, and ovomucin. 

The nucleic acid sequence (SEQ ID NO: 6) (GenBank Accession No. AF405538) of 

30 the insert in pAVIJCR-Al 15.93.1.2 is shown in Figures 1A-E. The modified human 
interferon a2b encoding nucleotide sequence SEQ ID NO: 5 (GenBank Accession No. 
AF405539) and the novel chicken lysozyme gene expression control region SEQ ID NO: 7 
(GenBank Accession No. AF405540), shown in Figures 2 and 3A-E, respectively. A 
polyadenylation signal sequence that is suitable for operably linking to the polypeptide- 
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SIGMA GENOSYS®, The Woodlands, TX or The Great Amencan Gene Co., Ramona, 
CA). 

525 RECOMBINANT EXPRESSION VECTORS 

A useful application of the novel promoters of the present invention, sneh as the 
avianlyso.ymegeneexpressioneor^lregionCSEQBNO.^ornaeMDOTpromoter 

construe, (SEQ ID NO: 1 1) is the possibility of increasing the amount of a heterologous 
10 protetopresen.mabird.espeeiaUyachieken.bygenettansfer. In moat .instences, a 
hetero.ogouspolvpepnde.neodingnue.eieaeidmaertnunsfer^ 
hoat will be operably linked with a gene expression control region to aUow the eeU « 
initiate and continue prodttetionofthe genetic product protein. ArecombtttantDNA 
molecule of fire present invention can be tiansferred into me extra-ohromoaomal or genonne 

15 DNA of the host 

Expression of a foreign gene in an avian cell permits partial or complete post- 
ttanslational modification such as, but not only, glycosylating and/or the formation of the 

ft. chicken OA. go/ta include pYepSecl (Baldari e, a,., 1987, KUB.OJ '6: 229-234, 

» ^«^,^w^)-.™' H,^M<,B • 0,, ''" 

D "*°' U« present invention contemplates that the injected cell may transiently contain the 
injeoted DNA, whereby the recombinant DNA or expression vector may not he mtegrated 
mto me genomic nucleic acid. It is tome, contempt ma, the injected recmnbtnan, DNA 

25 or expression vector may be stably integrated into the genomic DNA of the recipten. cell, 
thereby replicating with the cell so tat each daughter cell receives a copy of the tnjected 
nucleic acid. It is still further contemplated for the scope of the present invention to tnclude 
a transgenic animal producing a heterologous protein expressed fern an injected nuclerc 
acid according to the present invention. 

30 Heterologous nucleic acid molecules can be delivered to oocytes using the sperm- 

mediated transfeetion methods of the present invention. The nucleic acid molecule may be 
inserted into a cell to which the nucleic acid molecule (or promoter coding region) is 
heterologous (/.*., not normally present). Alternatively, the recombinant DNA molecule 
may be introduced into cells wMch normaUy contam me recombinant DNA molecule or me 
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particular coding region, as, for example, to correct a deficiency in the expression of a 
polypeptide, or where over-expression of the polypeptide is desired. 

Another aspect of the present invention, therefore, is a method of expressing a 
heterologous polypeptide in an avian cell hy transfecting the avian cell with a selected 

5 heterologous nucleic acid comprising an avian promoter operably linked to a nucleic acid 
insert encoding a polypeptide and, optionally, a polyadenylation signal sequence. The 
transfected cell, which may be an avian embryonic cell microinjected with a heterologous 
nucleic acid, will generate a transgenic embryo that after introduction into a recipient hen 
will be laid as a hard-shell egg and develop into a transgenic chick. 

1 0 In another embodiment of the present invention, the nucleic acid insert comprises 

the chicken lysozyme gene expression control region, a nucleic acid insert encoding a 
human interferon <x2b and codon optimized for expression in an avian cell, and a chicken 3' 
domain, i.e. s downstream enhancer elements. 

In one embodiment of the present invention, the transgenic animal is an avian 

1 5 selected from a turkey, duck, goose, quail, pheasant, ratite, and ornamental bird or a feral 
bird. In another embodiment, the avian is a chicken and the heterologous polypeptide 
produced under the transcriptional control of the avian promoter is produced in the white of 
an egg. In yet another embodiment of the present invention, the heterologous polypeptide is 
produced in the serum of a bird. 

20 

53 HETEROLOGOUS PROTEINS PRODUCED BY TRANSGENIC 
AVIANS 

Methods of the present invention, providing for the production of heterologous 
protein in the avian oviduct (or other tissue leading to deposition of the protein into the egg) 

25 and the production of eggs containing heterologous protein, involve providing a suitable 
vector coding for the heterologous protein and introducing the vector into oocytes by sperm- 
mediated transfection such that the vector is integrated into the genome of the resulting 
transgenic embryo. A subsequent step involves deriving a mature transgenic avian from the 
transgenic embryo produced in the previous steps by transferring the injected cell or cells 

30 into the infundibulum of a recipient hen; producing a hard shell egg from that hen; and 
allowing the egg to develop and hatch to produce a transgenic bird. 

A transgenic avian so produced from transgenic embryonic cells is known as a 
founder. Such founders may be mosaic for the transgene (in certain embodiments, the 
founder has 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 90%, 100% of the cells containing 

35 the. transgene. The invention further provides production of heterologous proteins in other 
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5.3.1 MULTIMERIC PROTEINS 

The invention, in preferred embodiments, provides methods for producing 
multimeric proteins, preferably immunoglobulins, such as antibodies, and antigen binding 
fragments thereof. 

5 In one embodiment of the present invention, the multimeric protein is an 

immunoglobulin, wherein the first and second heterologous polypeptides are an 
immunoglobulin heavy and light chains respectively. Illustrative examples of this and other 
aspects and embodiments of the present invention for the production of heterologous 
multimeric polypeptides in avian cells are fully disclosed in U.S. Patent Application No. 
10 09/877,374, filed June 8, 2001, by Rapp, which is incorporated herein by reference in its 
entirety. In one embodiment of the present invention, therefore, the multimeric protein is an 
immunoglobulin wherein the first and second heterologous polypeptides are an 
immunoglobulin heavy and light chain respectively. Accordingly, the invention provides 
immunoglobulin and other multimeric proteins that have been produced by transgenic 
15 avians of the invention. 

In the various embodiments of this aspect of the present invention, an 
immunoglobulin polypeptide encoded by the transcriptional unit of at least one expression 
vector may be an immunoglobulin heavy chain polypeptide comprising a variable region or 
a variant thereof, and may further comprise a D region, a J region, a C region, or a 
20 combination thereof. An immunoglobulin polypeptide encoded by the transcriptional unit 
of an expression vector may also be an immunoglobulin light chain polypeptide comprising 
a variable region or a variant thereof, and may further comprise a J region and a C region. It 
is also contemplated to be within the scope of the present invention for the immunoglobulin 
regions to be derived from the same animal species, or a mixture of species including, but 
25 not only, human, mouse, rat, rabbit and chicken. In preferred embodiments, the antibodies 
are human or humanized. 

In other embodiments of the present invention, the immunoglobulin polypeptide 
encoded by the transcriptional unit of at least one expression vector comprises an 
immunoglobulin heavy chain variable region, an immunoglobulin light chain variable 
30 region, and a linker peptide thereby forming a single-chain antibody capable of selectively 
binding an antigen. 

Another aspect of the present invention provides a method for the production in an 
avian of an heterologous protein capable of forming an antibody suitable for selectively 
binding an antigen comprising the step of producing a transgenic avian incorporating at 
35 least one transgene, wherein the transgene encodes at least one heterologous polypeptide 
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selected from an immunoglobulin heavy chain variable region, an immunoglobulin heavy 
chain comprising a variable region and a constant region, an immunoglobulin light chain 
variable region, an immunoglobulin light chain comprising a variable region and a constant 
region, and a single-chain antibody comprising two peptide-linked immunoglobulin variable 

5 regions. Preferably, the antibody is expressed such that it is deposited in the white of the 
developing eggs of the avian. The hard shell avian eggs thus produced can be harvested and 
the heterologous polypeptide capable of forming or which formed an antibody can be 
isolated from the harvested egg. It is also understood that the heterologous polypeptides 
may also be expressed under the transcriptional control of promoters that allow for release 

10 of the polypeptides into the serum of the transgenic animal. Exemplary promoters for non- 
tissue specific production of a heterologous protein are the CMV promoter and the RSV 
promoter. 

In one embodiment of this method of the present invention, the transgene comprises 
a transcription unit encoding a first and a second immunoglobulin polypeptide operatively 

1 5 linked to a transcription promoter, a transcription terminator and, optionally, an internal 
ribosome entry site (IRES) (see, for example, U.S. Patent No. 4,937,190 to Palmenberg et 
al, the contents of which is incorporated herein by reference in its entirety). 

In an embodiment of this method of the present invention, the isolated heterologous 
protein is an antibody capable of selectively binding to an antigen. In this embodiment, the 

20 antibody may be generated within the serum of an avian or within the white of the avian egg 
by combining at least one immunoglobulin heavy chain variable region and at least one 
immunoglobulin light chain variable region, preferably cross-linked by at least one di- 
sulfide bridge. The combination of the two variable regions will generate a binding site 
capable of binding an antigen using methods for antibody reconstitution that are well known 

25 in the art. 

It is, however, contemplated to be within the scope of the present invention for 
immunoglobulin heavy and light chains, or variants or derivatives thereof, to be expressed 
in separate transgenic avians, and therefore isolated from separate media including serum or 
eggs, each isolate comprising a single species of immunoglobulin polypeptide. The method 

30 may further comprise the step of combining a plurality of isolated heterologous 

immunoglobulin polypeptides, thereby producing an antibody capable of selectively binding 
to an antigen. In this embodiment, two individual transgenic avians may be generated 
wherein one transgenic produces serum or eggs having an immunoglobulin heavy chain 
variable region, or a polypeptide comprising such, expressed therein. A second transgenic 

35 animal, having a second transgene, produces serum or eggs having an immunoglobulin light 
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chain variable region, or a polypeptide comprising such, expressed therein. The 
polypeptides may be isolated from their respective sera and eggs and combined in vitro to 
generate a binding site capable of binding an antigen. 

Examples of therapeutic antibodies that can be used in methods of the invention 
5 include but are not limited to HERCEPTIN® (Trastuzumab) (Genentech, CA) which is a 
humanized anti-HER2 monoclonal antibody for the treatment of patients with metastatic 
breast cancer; REOPRO® (abciximab) (Centocor) which is an anti-glycoprotein Eh/ma 
receptor on the platelets for the prevention of clot formation; ZENAPAX® (daclizumab) 
(Roche Pharmaceuticals, Switzerland) which is an immunosuppressive, humanized anti- 
1 0 CD25 monoclonal antibody for the prevention of acute renal allograft rejection; 

PANOREX™ which is a murine anti-17-IA cell surface antigen IgG2a antibody (Glaxo 
Wellcome/Centocor); BEC2 which is a murine anti-idiotype (GD3 epitope) IgG antibody 
(ImClone System); IMC-C225 which is a chimeric anti-EGFR IgG antibody (ImClone 
System); VITAXDST™ which is a humanized anti-aVp3 integrin antibody (Applied 
15 Molecular Evolution/Medlmmune); Campath 1H/LDP-03 which is a humanized anti CD52 
IgGl antibody (Leukosite); Smart M195 which is a humanized anti-CD33 IgG antibody 
(Protein Design Lab/Kanebo); RITUXAN™ which is a chimeric anti-CD20 IgGl antibody 
(IDEC Pharm/Genentech, Roche/Zettyaku); LYMPHOCIDE™ which is a humanized anti- 
CD22 IgG antibody (Immunomedics); ICM3 is a humanized anti-ICAM3 antibody (ICOS 
20 Pharm); IDEC-1 14 is a primatied anti-CD80 antibody (IDEC Pharm/Mitsubishi); 

ZEVALIN™ is a radiolabelled murine anti-CD20 antibody (DDEC/Schering AG); IDEC- 
131 is a humanized anti-CD40L antibody (BDEC/Eisai); IDEC-151 is a primatized anti-CD4 
antibody (IDEC); IDEC-1 52 is a primatized anti-CD23 antibody (IDEC/Seikagaku); 
SMART anti-CD3 is a humanized anti-CD3 IgG (Protein Design Lab); 5G1.1 is a 
25 humanized anti-complement factor 5 (C5) antibody (Alexion Pharm); D2E7 is a humanized 
anti-TNF-a antibody (CAT/BASF); CDP870 is a humanized anti-TNF-a Fab fragment 
(Celltech); IDEC-151 is a primatized anti-CD4 IgGl antibody (IDEC Pharm/SmithKline 
Beecham); MDX-CD4 is a human anti-CD4 IgG antibody (Medarex/Eisai/Genmab); 
CDP571 is a humanized anti-TNF-a IgG4 antibody (Celltech); LDP-02 is a humanized anti- 
30 tt 4p7 antibody (LeukoSite/Genentech); OrthoClone OKT4A is a humanized anti-CD4 IgG 
antibody (Ortho Biotech); ANTOVA™ is a humanized anti-CD40L IgG antibody (Biogen); 
ANTEGREN™ is a humanized anti-VLA-4 IgG antibody (Elan); and CAT-152 is a human 
anti-TGF-P 2 antibody (Cambridge Ab Tech). 
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5.3.2 PROTEIN RECOVERY 

The protein of the present invention may be produced in purified form by any known 
conventional technique. For example, chicken cells may be homogenized and centrifuged. 
The supernatant can then be subjected to sequential ammonium sulfate precipitation and 

5 heat treatment The fraction containing the protein of the present invention is subjected to 
gel filtration in an appropriately sized dextran or polyacrylamide column to separate the 
proteins. If necessary, the protein fraction may be further purified by HPLC. In another 
embodiment, an affinity column is used, wherein the protein is expressed with a tag. 

Accordingly, the invention provides proteins that are produced by transgenic avians 

10 of the invention. In a preferred embodiment, the protein is produced and isolated from an 
avian egg. In another embodiment, the protein is produced and isolated from avian serum. 

5.4 PHARMACEUTICAL COMPOSITIONS 

The present invention further provides pharmaceutical compositions, formulations, 

1 5 dosage units and methods of administration comprising the heterologous proteins produced 
by the transgenic avians using methods of the invneion. Preferably, compositions of the 
invention comprise a prophylactically or therapeutically effective amount of a the 
heterologous protein, and a pharmaceutical^ acceptable carrier. 

The term "carrier" refers to a diluent, adjuvant, excipient, or vehicle with which a 

20 compound of the invention is administered. Such pharmaceutical vehicles can be liquids, 
such as water and oils, including those of petroleum, animal, vegetable or synthetic origin, 
such as peanut oil, soybean oil, mineral oil, sesame oil and the like. The pharmaceutical 
vehicles can be saline, gum acacia, gelatin, starch paste, talc, keratin, colloidal silica, urea, 
and the like. In addition, auxiliary, stabilizing, thickening, lubricating and coloring agents 

25 may be used. When administered to a patient, the compounds of the invention and 
pharmaceutical^ acceptable vehicles are preferably sterile. Water is a preferred vehicle 
when the compound of the invention is administered intravenously. Saline solutions and 
aqueous dextrose and glycerol solutions can also be employed as liquid vehicles, 
particularly for injectable solutions. Suitable pharmaceutical vehicles also include 

30 excipients such as starch, glucose, lactose, sucrose, gelatin, malt, rice, flour, chalk, silica 
gel, sodium stearate, glycerol monostearate, talc, sodium chloride, dried skim milk, 
glycerol, propyleneglycol, water, ethanol and the like. The present compositions, if desired, 
can also contain minor amounts of wetting or emulsifying agents, or pH buffering agents. 
The present compositions can take the form of solutions, suspensions, emulsion, 

35 tablets, pills, pellets, capsules, capsules containing liquids, powders, sustained-release 
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formulations, suppositories, emulsions, aerosols, sprays, suspensions, or any other form 
suitable for use. In one embodiment, the pharmaceutical^ acceptable vehicle is a capsule 
(see e.g., U.S. Patent No. 5,698,155). Other examples of suitable pharmaceutical vehicles 
are described in "Remington: the Science and Practice of Pharmacy", 20th ed., by Mack 

5 Publishing Co. 2000. 

In a preferred embodiment, the heterologous proteins are formulated in accordance 
with routine procedures as a pharmaceutical composition adapted for intravenous 
administration to human beings. Typically, compounds of the invention for intravenous 
administration are solutions in sterile isotonic aqueous buffer. Where necessary, the 
10 compositions may also include a solubilizing agent. Compositions for intravenous 

administration may optionally include a local anesthetic such as lignocaine to ease pain at 
the site of the injection. Generally, the ingredients are supplied either separately or mixed 
together in unit dosage form, for example, as a dry lyophilized powder or water free 
concentrate in a hermetically sealed container such as an ampoule or sachette indicating the 
1 5 quantity of active agent. Where the heterologous protein of the invention is to be 
administered by infusion, it can be dispensed, for example, with an infusion bottle 
containing sterile pharmaceutical grade water or saline. Where the composition of the 
invention is administered by injection, an ampoule of sterile water for injection or saline can 
be provided so that the ingredients may be mixed prior to administration. 
20 Compositions for oral delivery may be in the form of tablets, lozenges, aqueous or 

oily suspensions, granules, powders, emulsions, capsules, syrups, or elixirs, for example. 
Orally administered compositions may contain one or more optional agents, for example, 
sweetening agents such as fructose, aspartame or saccharin; flavoring agents such as 
peppermint, oil of wintergreen, or cherry; coloring agents; and preserving agents, to provide 
25 a pharmaceutical^ palatable preparation. Moreover, where in tablet or pill form, the 
compositions may be coated to delay disintegration and absorption in the gastrointestinal 
tract thereby providing a sustained action over an extended period of time. Selectively 
permeable membranes surrounding an osmotically active driving compound are also 
suitable for orally administered compounds of the invention. In these later platforms, fluid 
30 from the environment surrounding the capsule is imbibed by the driving compound, which 
swells to displace the agent or agent composition through an aperture. These delivery 
platforms can provide an essentially zero order delivery profile as opposed to the spiked 
profiles of immediate release formulations. A time delay material such as glycerol 
monostearate or glycerol stearate may also be used. Oral compositions can include standard 



-59- 



WO 03/024199 PCT/US02/30156 



vehicles such as mannitol, lactose, starch, magnesium stearate, sodium saccharin, cellulose, 
magnesium carbonate, etc. Such vehicles are preferably of pharmaceutical grade. 

Further, the effect of the heterologous proteins may be delayed or prolonged by 
proper formulation. For example, a slowly soluble pellet of the compound may be prepared 

5 and incorporated in a tablet or capsule. The technique may be improved by making pellets 
of several different dissolution rates and filling capsules with a mixture of the pellets. 
Tablets or capsules may be coated with a film which resists dissolution for a predictable 
period of time. Even the parenteral preparations may be made long-acting, by dissolving or 
suspending the compound in oily or emulsified vehicles which allow it to disperse only 

10 slowly in the serum. 

5.5 TRANSGENIC AVIANS 

Another aspect of the present invention concerns transgenic avians, preferably 
chicken or quail, produced by methods of the invention described in section 5.1 infra, 

1 5 preferably by introducing a nucleic acid comprising a transgene into an avian oocyte by the 
sperm-mediated transfection methods of the present invention. In one embodiment, a 
heterologous nucleic acid introduced to an avian oocyte by sperm-mediated transfection, 
resulting in a transgenic embryo which is then allowed to develop, preferably, transferred 
into the reproductive tract of a recipient hen where it is encapsulated by natural egg white 

20 proteins and a natural egg shell, then it is incubated and hatched to produce a transgenic 
chick. The heterologous polypeptide or polypeptides encoded by the transgenic 
heterologous nucleic acid may be secreted into the oviduct lumen of the mature transgenic 
chicken and deposited as a constituent component of egg white. The resulting transgenic 
avian chick the GO) will carry one or more desired transgene(s) some or all of its cells, 

25 preferably in its germ line. These GO transgenic avians can be bred using methods well 
known in the art to generate second generation {i.e., Gls) transgenic avians that carry the 
transgene, i. e., achieve germline transmission of the transgene. In preferred embodiments, 
the methods of the invention result in germline transmission, i.e., percentage of GOs that 
transmit the transgene to progeny (Gls), that is greater than 5%, preferably, greater than 

30 10%, 20%, 30%, 40%, and, most preferably, greater than 50%, 60%, 70%, 80%, 90% or 
even 100%. In other embodiments, the efficiency of transgenesis (i.e., number of GOs 
containing the transgene) is greater than 2%, 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 
80% or 99%. 

The egg can be harvested after laying and before hatching of a chick, or further 
35 incubated to generate a cloned chick, optionally genetically modified. The cloned chick 
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may carry a transgene in all or most of its cells. After maturation, the transgenic avian may 
lay eggs that contain one or more desired heterologous protein(s). 

Hie cloned chick may also be a knock-in chick expressing an alternative phenotype 
or capable of laying eggs having an heterologous protein therein. The reconstructed egg 
5 ma y also be cultured to term using the ex ovo method described by Perry et al. {supra). 

Following maturation, the transgenic avian and/or transgenic progeny thereof, may 
lay eggs containing one or more desired heterologous protein(s) expressed therein and that 
can be easily harvested therefrom. The Gl chicks, when sexually mature, can then be bred 
to produce progeny that are homozygous or heterozygous for the transgene. 
10 A transgenic avian of the invention may contain at least one transgene, at least two 

transgenes, at least 3 transgenes, at least 4 transgenes, at least 5 transgenes, and preferably, 
though optionally, may express the subject nucleic acid encoding a polypeptide in one or 
more cells in the animal, such as the oviduct cells of the chicken. In embodiments of the 
present invention, the expression of the transgene may be restricted to specific subsets of 
15 cells, tissues, or developmental stages utilizing, for example, cis-acting sequences that 
control expression in the desired pattern. Toward this end, it is contemplated that tissue- 
specific regulatory sequences, or tissue-specific promoters, and conditional regulatory 
sequences may be used to control expression of the transgene in certain spatial patterns. 
Moreover, temporal patterns of expression can be provided by, for example, conditional 
20 recombination systems or prokaryotic transcriptional regulatory sequences. The inclusion 
of a 5' MAR region, and optionally the 3' MAR on either end of the sequence, in the 
expression cassettes suitable for use in the methods of the present invention may allow the 
heterologous expression unit to escape the chromosomal positional effect (CPE) and 
therefore be expressed at a more uniform level in transgenic tissues that received the 
25 transgene by a route other than through germ line cells. 

The transgenes may, in certain embodiments, be expressed conditionally, e.g., the 
heterologous protein coding sequence is under the control of an inducible promoter, such as 
a prokaryotic promoter or operator that requires a prokaryotic inducer protein to be 
activated. Operators present in prokaryotic cells have been extensively characterized in vivo 
30 and in vitro and can be readily manipulated to place them in any position upstream from or 
within a gene by standard techniques. Such operators comprise promoter regions and 
regions that specifically bind proteins such as activators and repressors. One example is the 
operator region of the lexA gene of E. coli to which the LexA polypeptide binds. Other 
exemplary prokaryotic regulatory sequences and the corresponding trans-activating 
35 prokaryotic proteins are disclosed by Brent and Ptashne in U.S. Patent No. 4,833,080 (the 
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contents of which is herein incorporated by reference in its entirety). Transgenic animals 
can be created which harbor the subject transgene under transcriptional control of a 
prokaryotic sequence or other activator sequence that is not appreciably activated by avian 
proteins. Breeding of this transgenic animal with another animal that is transgenic for the 
5 corresponding trans-activator can be used to activate of the expression of the transgene. . 
Moreover, expression of the conditional transgenes can also be induced by gene therapy-like 
methods wherein a gene encoding the trans-activating protein, e.g., a recombinase or a 
prokaryotic protein, is delivered to the tissue and caused to be expressed, such as in a cell- 
type specific manner. 

1 0 Transactivators in these inducible or repressible transcriptional regulation systems 

are designed to interact specifically with sequences engineered into the transgene. Such 
systems include those regulated by tetracycline ("tet systems"), interferon, estrogen, 
ecdysone, Lac operator, progesterone antagonist RU486, and rapamycin (FK506) with tet 
systems being particularly preferred (see, e.g., Gingrich and Roder, 1998, Annu. Rev. 

1 5 Neurosci. 21 : 377-405; incorporated herein by reference in its entirety). These drugs or 
hormones (or their analogs) act on modular transactivators composed of natural or mutant 
ligand-binding domains and intrinsic or extrinsic DNA binding and transcriptional 
activation domains. In certain embodiments, expression of the heterologous peptidecan be 
regulated by varying the concentration of the drug or hormone in medium in vitro or in the 

20 diet of the transgenic animal in vivo. 

In a preferred embodiment, the control elements of the tetracycline-resistance operon 
of E. coli is used as an inducible or repressible transactivator or transcriptional regulation 
system ("tet system") for conditional expression of the transgene. A tetracycline-controlled 
transactivator can require either the presence or absence of the antibiotic tetracycline, or one 

25 of its derivatives, e.g., doxycycline (dox), for binding to the tet operator of the tet system, 
and thus for the activation of the tet system promoter (Ptet). 

In a specific embodiment, a tetracycline-repressed regulatable system (TrRS) is used 
(Agha-Mohammadi and Lotze, 2000, J. Clin. Invest. 105(9): 1177-83; ShocketUra/., 1995, 
Proc. Natl. Acad. Sci. USA 92: 6522-26; and Gossen and Bujard, 1992, Proc. Natl. Acad. 

30 Sci. USA 89: 5547-51; incorporated herein by reference in their entireties). 

In another embodiment, a reverse tetracycline-controlled transactivator, e.g., rtTA2 
S-M2, is used. rtTA2 S-M2 transactivator has reduced basal activity in the absence 
doxycycline, increased stability in eukaryotic cells, and increased doxycycline sensitivity 
(Urlinger et al, 2000, Proc. Natl. Acad. Sci. USA 97(14): 7963-68; incorporated herein by 

35 reference in its entirety). In another embodiment, the tet-repressible system described by 
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Wells etal. (1999, Transgenic Res. 8(5): 371-81; incorporated herein by reference in its 
entirety) is used. In one aspect of the embodiment, a single plasmid Tet-repressible system 
is used.. In another embodiment, the GAL4-UAS system (Ornitz et al., 1991, Proc. Natl. 
Acad. Sci. USA 88:698-702; Rowitch etal, 1999, J. Neuroscience 19(20):8954-8965; 
5 Wang et al., 1999, Proc. Natl. Acad. Sci. USA 96:8483-8488; Lewandoski, 2001, Nature 
Reviews (Genetics) 2:743-755) or a GAL4-VP16 fusion protein system (Wang et al., 1999, 
Proc. Natl. Acad Sci. USA 96:8483-8488) is used. 

In other embodiments, conditional expression of atransgene is regulated by using a 
recombinase system that is used to turn on or off the gene's expression by recombination in 
10 the appropriate region of the genome in which the potential drug target gene is inserted. 
The transgene is flanked by recombinase sites, e.g., FRT sites. Such a recombinase system 
can be used to turn on or off expression a transgene (for review of temporal genetic 
switches and "tissue scissors" using recombinases, see Hennighausen & Furth, 1999, Nature 
Biotechnol. 17: 1062-63). Exclusive recombination in a selected cell type may be mediated 
15 by use of a site-specific recombinase such as Cre, FLP-wild type (wt), FLP-L or FLPe. 
Recombination may be effected by any art-known method, e.g., the method of Doetschman 
et al. (1987, Nature 330: 576-78; incorporated herein by reference in its entirety); the 
method of Thomas et al, (1986, Cell 44: 419-28; incorporated herein by reference in its 
entirety); the Cre-loxP recombination system (Sternberg and Hamilton, 1981, J. Mol. Biol. 
20 150: 467-86; Lakso et al, 1992, Proc. Natl. Acad Sci. USA 89: 6232-36; which are both 
incorporated herein by reference in their entireties); the FLP recombinase system of 
Saccharomyces cerevisiae (O'Gorman et al, 1991, Science 251: 1351-55); the Cre-loxP- 
tetracycline control switch (Gossen and Bujard, 1992, Proc. Natl. Acad Sci. USA 89: 5547- 
51, incorporated herein by reference in its entirety); and ligand-regulated recombinase 
25 system (Kellendonk et al, 1999, J. Mol Biol. 285: 175-82; incorporated herein by reference 
in its entirety). Preferably, the recombinase is highly active, e.g., the Cre-loxP or the FLPe 
system, and has enhanced thermostability (Rodriguez et al, 2000, Nature Genetics 25: 139- 
40; incorporated herein by reference in its entirety). 

In a specific embodiment, the ligand-regulated recombinase system of Kellendonk et 
30 al. (1999, J. Mol. Biol. 285: 175-82; incorporated herein by reference in its entirety) can be 
used. In this system, the ligand-binding domain (LBD) of a receptor, e.g., the progesterone 
or estrogen receptor, is fused to the Cre recombinase to increase specificity of the 
recombinase. 

In the case of an avian, a heterologous polypeptide or polypeptides encoded by the 
35 transgenic nucleic acid may be secreted into the oviduct lumen of the mature animal and 
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deposited as a constituent component of the egg white into eggs laid by the animal. It is 
also contemplated to be within the scope of the present invention for the heterologous 
polypeptides to be produced in the serum of a transgenic avian. 

A leaky promoter such as the CMV promoter may be operably linked to a transgene, 
5 resulting in expression of the transgene in all tissues of the transgenic avian, resulting in 
production of, for example, immunoglobulin polypeptides in the serum. Alternatively, the 
transgene may be operably linked to an avian promoter that may express the transgene in a 
restricted range of tissues such as, for example, oviduct cells and macrophages so that the 
heterologous protein may be identified in the egg white or the serum of a transgenic avian. 
1 0 Transgenic avians produced by the sperm-mediated transfection methods of the present 
invention will have the ability to lay eggs that contain one or more desired heterologous 
protein(s) or variant thereof. 

One embodiment of the present invention, therefore, is a transgenic avian produced 
by the sperm-mediated transfection methods of the present invention and having a 
15 heterologous polynucleotide sequence comprising a nucleic acid insert encoding a 

heterologous polypeptide and operably linked to an avian lysozyme gene expression control 
region, the gene expression control region comprising at least one 5' matrix attachment 
region, an intrinsically curved DNA region, at least one transcription enhancer, a negative 
regulatory element, at least one hormone responsive element, at least one avian CR1 repeat 
20 element, and a proximal lysozyme promoter and signal peptide-encoding region. 

Another embodiment of the present invention provides a transgenic avian further 
comprising a transgene with a lysozyme 3' domain. 

Accordingly, the invention provides transgenic avians produced by methods of the 
invention as described infra. In preferred embodiments, the transgenic avian contains a 
25 transgene comprising a heterologous peptide coding sequence operably linked to a promoter 
and, in certain embodiments, other regulatory elements. In more preferred embodiments, 
the transgenic avians of the invention produce heterologous proteins, preferably in a tissue 
specific manner, more preferably such that they are deposited in the serum and, most 
preferably, such that the heterologous protein is deposited into the egg, particularly in the 
30 egg white. In preferred embodiments, the transgenic avians produce eggs containing greater 
than 5 ug, 10 ug, 50 ug, 100 ug, 250 ug, 500 ug, or 750 ug, more preferably greater than 1 
mg, 2 mg, 5 mg, 10 mg, 20 mg, 50 mg, 100 mg, 200 mg, 500 mg, 700 mg, 1 gram, 2 grams, 
3 grams, 4 grams or 5 grams of the heterologous protein. In preferred embodiments, the 
transgenic avians produce an immunoglobulin molecule and deposit the immunoglobulin in 
35 the egg or serum of the avian, and preferably, the immunoglobulin isolated from the egg or 
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serum specifically binds its cognate antigen. The antibody so produced may bind the 
antigen with the same, greater or lesser affinity than the antibody produced in a mammalian 
cell, such as a myeloma or CHO cell. 

In specific embodiments, the transgenic avians of the invention were not produced 

5 or are not progeny of a transgenic ancestor produced using a eukaryotic viral vector, more 
particularly, not a retroviral vector (although, in certain embodiments, the vector may 
contain sequences derived from a eukaryotic viral vector, such as promoters, origins of 
replication, etc.). The transgenic avians of the invention include GO avians, founder 
transgenic avians, Gl transgenic avians, avians containing the transgene in the sperm or 

1 0 ova, avians mosaic for the transgene and avians containing copies of the transgene in most 
or all of the cells. Contemplated by the invention are transgenic avians in which the 
transgene is episomal. In more preferred embodiments, the transgenic avians have the 
transgene integrated into one or more chromosomes. Chromosomal integration can be 
detected using a variety of methods well known in the art, such as, but not limited to, 

15 Southern blotting, PCR, etc. 

6. EXAMPLES 

The present invention is further illustrated by the following examples. Each 
example is provided by way of explanation of the invention, and is not intended to be a 

20 limitation of the invention. In fact, it will be apparent to those skilled in the art that various 
modifications, combination, additions, deletions and variations can be made in the present 
invention without departing from the scope or spirit of the invention. For instance, features 
illustrated or described as part of one embodiment can be used in another embodiment to 
yield a still further embodiment It is intended that the present invention covers such 

25 modifications, combinations, additions, deletions and variations as come within the scope of 
the appended claims and their equivalents. 

All references cited herein are incorporated herein by reference in their entirety and 
for all purposes to the same extent as if each individual publication, patent or patent 
30 application was specifically and individually indicated to be incorporated by reference in its 
entirety for all purposes. The citation of any publication is for its disclosure prior to the 
filing date and should not be construed as an admission that the present invention is not 
entitled to antedate such publication by virtue of prior invention. 



35 
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6.1 Example 1 : Vectors Having Sperm-Specific Reporter Genes 

The specific activity of spermatogenesis-specific promoters, such as the protamine 
promoter necessary for post-meiotic-specific transcription of this gene may be used to 
selectively mark those sperm cells that have inherited the transgene of interest after meiotic 
5 segregation. 

The construct contains two separate elements. In one example, the first element 
comprises an oviduct-specific promoter, such as that associated with a gene encoding 
ovalbumin, lysozyme, ovomucoid, ovotransferrin, coiialbumin or ovomucin, The promoter 
is operatively linked to, and therefore drives Ihe expression of a gene coding for a desired 

1 0 heterologous protein of interest, such as, but not limited to, a therapeutic protein like 
interferon, erythropoietin (EPO), or an immunoglobulin. 

The second element, which can be located either upstream or downstream from the 
first element, contains the protamine promoter, or any fragment thereof that is sufficient to 
drive the expression of a marker gene encoding a vital and color marker, such as the Green 

15 Fluorescent Protein (GFP). Those sperm cells that incorporate the transgene into their 
genomic DNA are vitally labeled during Ihe late stages of spermiogenesis by the expression 
of the GFP protein. Given that the construct contains both the above first and the second 
elements, positive sperm cells also contain the transgene of interest. 

Large numbers of positive sperm cells expressing the GFP protein are isolated using 

20 Fluorescent Activated Cell Sorting (FACS). Sperm cells selected on the basis of the 
expression of the incorporated marker gene are then used to breed hens by artificial 
insemination protocols. Suitable avian insemination protocols have been described by 
Etches (1996) Reprod. in Poultry (CAB International, Wallingford, UK), incorporated 
herein by reference in its entirety. In those cases where the number of positive sperm 

25 obtained after FACS isolation is too low for the likelihood of successful artificial 

insemination, the females may be fertilized by the intramagnal insemination method of 
Engel (1991) Poult Sci. 70:1965-1969 or Trefil (1996) Br. Poult. Sci. 37:661-664, 
incorporated herein by reference in their entireties. Alternatively, small numbers of positive 
sperm cells are isolated under a microscope using UV light and then microinjected into 

30 unfertilized eggs via the Intracytoplasmic Sperm Injection (ICSI) protocols of Perry (1999), 
incorporated herein by reference in its entirety. 

6.2 Example 2: Lipofection Gene Transfer to Avian Oocytes 

(a) Isolation of the ovum: Donor hens were inseminated using the protocol for 
3 5 avian artificial insemination described by Etches (1 996), incorporated herein by reference in 
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its entirety. Fertilized ova were collected from the magnum region of the oviduct of 
euthanized birds 1 .5-3 hours after oviposition. Alternatively, a hen whose oviduct is 
fistulated allows the collection of eggs for enucleation as taught by Gilbert and Woodgush, 
(1963, J. Reprod Fertility 5: 451-453) and Pancer et al, (1989, Br. Poult. Sci. 30: 953-7). 
5 The thick albumen capsule surrounding the ovum was removed using spatulas and the ovum 
was placed in a well 48mm diameter and 23 mm in height containing Perry's salt solution 
(see Perry (1988), incorporated herein by reference in its entirety). 

(b) Preparation oflipofection solutions: Two lipofection solutions were used. The 
first solution comprised 50ug/ml of LEPOFECTAMINE™ (Gibco) pre-incubated for 1 hour 

10 with the restriction endonuclease Not I (500 Units Not I per ml oflipofection solution), and 
designated herein as «Lipofectamme/Not I solution". The second lipofection solution was 
composed of 50ug/ml of LIPOFECT AMINE™ pre-incubated for 1 hour with 500ug of 
peGFP linearized with Not I per ml of lipofection solution, herein described as 
"Lipofectamine/peGFP solution." Lipofectin-treated eggs were then incubated for 1 hour. 

15 (c) Gene transfer to avian oocytes by lipofection: The isolated ovum was then 

placed inside a glass conical chamber (Figure 1 A) so that the blastodisc was located in the 
center of a window that opens at the narrower end of the conical chamber. A 40 mm 
diameter and 8 mm high glass dish was used at the bottom of the cone to close the system. 
Perry salt solution was added to the bottom of the dish to prevent drying of the lower half of 

20 the ovum. The Perry's salt solution overlaying the blastodisc (accessed through the window 
opening of the cone) was then replaced by, for example, 100 ul, of a lipofection solution 
described below. The eggs were incubated for 1 hour. Alternatively, egg incubation can be 
done by adding the lipofection solutions to the well and inverting the position of the 
incubation chamber (Figure IB), or by using a cloning cylinder around the blastodisc 

25 (Figure 1C). 

(d) Transfer of the lipofected egg: In a preferred embodiment, the ovum is 
surgically transferred into the oviduct of the recipient hen shortly after lipofection according 
to a described surgical procedure. (Tanaka, 1994, supra). The recipient hens are 
anesthetized by wing vein injection with pentobarbital (0.7 ml of a 68 mg/ml solution) or 

30 using gas anesthetics such as Isoflurane shortly after laying. During this window, the 

infundibulum is receptive to receiving a donor ovum but that has not yet ovulated. Feathers 
are removed from the abdominal area, the area is scrubbed with betadine and rinsed with 
70% ethanol. The bird is placed in a supine position and a surgical drape is placed over the 
bird exposing the surgical area. An incision is made beginning at the junction of the sternal 

35 rib to the breastbone and running parallel to the breastbone. The length of the incision is 
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approximately 6cm. After cutting through the smooth muscle layers and the peritoneum, 
the infundibulum is located. The infundibulum is externalized and opened using gloved 
hands. The donor ovum is gently placed in the open infundibulum. Gravity facilitates the 
movement of the ovum through the infundibulum and into the anterior magnum. The 

5 internalized ovum is placed into the body cavity and the incision closed using interlocking 
stitches both for the smooth muscle layer and the skin. The recipient hen is returned to her 
cage and allowed to recover with free access to both feed and water. The hens resume 
normal activities after a post-operative recovery time of less than 45 minutes. Once 
transferred, the embryo develops inside the recipient hen and travels through the oviduct 

10 where it is encapsulated by natural egg white proteins and a natural eggshell. Eggs laid by 
the recipient hens are collected the next day, set, and incubated in a Jamesway incubator. 
The eggs hatch 21 days later. 

6.3 Example 3: Maintenance of Plasmid Linearization in the Remi 
15 Procedure 

A plasmid that is to be integrated into the genomic nucleic acid of a sperm is 
linearized by cleavage with a selected restriction endonuclease. The linearized nucleic acid 
is then dephosphorylated at the exposed 5' ends of the newly formed cohesive regions by 
alkaline phosphatase treatment. Suitable protocols for the alkaline phosphatase 

20 dephosphorylation of nucleic acids are disclosed, for example, by Sambrook et al, (supra), 
incorporated herein by reference in its entirety. 

While not wishing to be bound by any one theory, it is believed that 
dephosphorylated cohesive ends of the nucleic acid may hybridize to recircularize the 
cleaved plasmid. Dephosphorylation of the 5' termini, however, prevent a DNA ligase from 

25 covalently rejoining a 5' terminus to the adjacent 3' terminus, thereby preventing a stable 
circular plasmid molecule from reforming. The cohesive ends of the non-ligated 
circularized plasmid may dissociate within a sperm cell to give a linearized nucleic acid that 
may integrate into the sperm genomic DNA 

Alternatively, a circular plasmid having a heterologous nucleic acid that is to be 

30 integrated into the genomic nucleic acid of a sperm is digested with at least two different 
restriction endonucleases that generate a linearized plasmid having two non-cohesive ends, 
and wherein the desired transgenic element heterologous nucleic acid remains intact 
between Ihe new termini of the cleaved plasmid. The restriction endonucleases are selected 
to give dissimilar cohesive ends that cannot hybridize together to recircularize the cleaved 

35 plasmid. The linearized nucleic acid is then delivered to the sperm with both of the 
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restriction endonucleases used to cleave the plasmid. The restriction endonucleases may be 
delivered to the sperm sequentially or simultaneously and combined, or sequentially 
delivered, with the cleaved plasmid. 

It can be advantageous, depending upon the positions of the endonuclease cleavage 
5 sites within the plasmid relative to the desired transgene, to use two different endonucleases 
that produce hybridizable cohesive ends. In this case, the 5' termini may also be 
dephosphorylated with alkaline phosphatase as described above, to prevent religation and 
stabilization of the cleaved plasmid. 

10 6.4 Example 4: Methods for Determing the SV40 Ori Requirement in SMT 

To determine the requirement for the SV40 origin of replication in sperm-mediated 
transgenesis, 5 ug each of the plasmids pi 083 (with the CMV promoter controlling heavy 
chain tanscription) and pl086 (where the CMV promoter controls light chain transcription) 
were digested with Dra HI which excises the SV40 origin of replication from the pl083 
15 plasmid while retaining the SV40 origin of replication of the P 1086 plasmid. For 

comparison, 5 ug each of the plasmids pl083 and P 1086 were digested with the restriction 
endonuclease Mlu I that linearizes both plasmids while retaining the SV40 origin of 
replication in each of the respective plasmids. 

Digested plasmids were used to transfect sperm. In a polystyrene tube, Dra HI- 

~~ . , . - „i«o>- -ipoq /■« «f««>,-V> «7<>re nrlHedto 100 ul of OPTIMEM™ 

zu digestea piasrmas piuoo anu jjiuoj vy p& " a ~""v "~ c — VJ -- • 

medium (Life Technologies, Gaithersburg, MD) and 10 ug of LIPOFECTAMINE™ 
liposome (Life Technologies, Gaithersburg, MD). In a separate tube, 100 units of Dra HI 
restriction enzyme were added to 100 ul of OPTIMEM™ medium followed by 10 ug of 
LIPOFECTAMINE™. The tubes were incubated at room temperature for 30 minutes, then 

25 added to freshly collected semen containing 10 9 chicken sperm (approximately 300 ul of 
semen). The sperm, DNA-liposome, and restriction enzyme-liposome mixture was 
incubated at room temperature for 30 minutes. 

Two White Leghorn hens were then artificially inseminated with 250 ul each of the 
transfection mixture. Eggs were collected for 7 days starting on the second day after 

30 fertilization, and set for hatch. Two weeks after hatch, serum samples were collected and 
assayed for human monoclonal antibodies by ELISA. The results are shown in Figure 10, 
wherein wing band number 3932 is the control. 
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6 5 Example 5: Gamma hradiauon of Chicken Sperm 
togenouB.ltaearizedDNAcaobetattgra^intotogenomeofaRcimen.spenn 

to lioofection thereof with the transgenic nucleic acid. 
5 Woostere/rf.fonndWroosterspennir^atod^nGrays^ofga™ 

in ^ (M ^n.^ut43%r«dd m IferuU V .(1977,C^^^.C^UM37. 

radiation: 0, 1.3. 10, 15,and20Gy. A Uposomal complex wffl constat of lOug of 
ltoealta d DNA contoining a promoter (eg.. CMV, ovalbumin, .ysozy^e, ovomucotd 

and 10 ug of liPOFECT AMINE™ (Life Technologies, Oaithersbnrg, MD) wtil then 

analysis of blood DNA. 

6 6 Examples 6: Ovnm Transfer to a Laying Hen 

A, the time of laying, recipient hens are anesthetized by wtag vein injection wtth 
20 pentobarbita (0.7 ml of a 68 mg/ml solution) or by a gaseous anesthetic such as fcoflurane. 

axea andmea K aissc™l*cdwltobetodin.,andrm S edwith70%emanol. Thebtrdts 
^tasupmepoaition^asurg^d^lap^^^Wwlmte^^ 

25 ^sed. rhaJonisn^beghming^^onofmes^ribto^^ 
and nmning parallel to me breastbone. The length of me incision is approxtmately wo 
inches. j^^^^m^m^^^t^m*' 
tofcndibnlum is located, tte internum U externalized and opened ttsmg gloveri tands 

30 move into me infundibulum and into me anterior magnnm by *av„y feed The 

ovum is placed into the body cavity and me incision closed using mtorlockmg stitches bom 

np moving and feeding is nsusJly within 45 minutes of the operation's end. Eggs tad by 
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the recipient hens are collected the next day, set, and incubated. They will hatch 21 days 
later. 

6.7 Example 7: Generation of Transgenic Chickens by Sperm-Mediated 
5 Transf ection of Heterologous Nucleic Acid 

Plasmid pRC/CMV-EGFP, 10 jig, was added to 100 \i\ of OPTIMEM™ medium 
(Life Technologies, Gaithersburg, MD) and 10 ng of LIPOFECTAMINE™ (Life 
Technologies, Gaithersburg, MD) liposomes, in a polystyrene tube. In a separate tube, 100 
units of Dra HI restriction enzyme was added to 100 ng of OPTIMEM™ medium followed 

10 by 10 |ig of LIPOFECTAMINE™. As negative controls, plasmids pl086 and pl083 were 
used for pRC/CMV-EGFP in the transfection mixture. Tubes were incubated at room 
temperature for 30 minutes, then added to 10 9 freshly collected chicken sperm 
(approximately 300 \il of sperm). The sperm, DNA-liposome, and restriction enzyme- 
liposome mixture was incubated at room temperature for 30 minutes. 

1 5 Two White Leghorn hens were inseminated with the transfection mixture, each hen 

receiving approximately 250 [il of the transfection mixture. Eggs were collected for 7 days 
starting on the second day after fertilization, and set for hatch. 

Four days after hatching, blood drops from chicks were collected from leg veins 
with heparinized capillary tubes and placed on microscope slides. Blood smears were 

20 viewed with FITC illumination with an inverted microscope (Olympus 1X70, 1 00 watt 
mercury lamp, HQ-FITC Band Pass Emission filter cube, excitation 480/40 nm, emission 
535/50 nm, and 20X phase contrast objective). Auto-fluorescence was assessed using a 
TRITC filter (Olympus Modular B-MAX Filter cube, excitation 535/50 nm, emission 
610/75 nm). 

25 Two chicks that resulted from sperm transfected with pRC/CMV-EGFP had white 

blood cells showing green fluorescence. No fluorescence was seen when viewed with the 
TRITC filter, indicating that the green fluorescence was not due to auto-fluorescence. None 
of the control chicks, derived from sperm transferred with control plasmids, had green 
fluorescence in their blood. 

30 

6.8 Example 8: Sperm-Mediated Transfection of Japanese Quail Ova 

Prophetic Example 

Japanese Quail hens will be artificially inseminated with sperm transfected with 
vectors capable of expressing a-IFN, erythropoietin or a monoclonal antibody. ELISAs will 
35 be used to detect and measure the amount of an expressed transgene product in the animal's 
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serum and egg. As little as 15pg of a-interferon (a-IFN) or erythropoietin can be detected 
by this procedure. 

To prepare the quail flock for artificial insemination, the females will be separated 
from the males. Once the isolated females are no longer laying fertile eggs and the males 
5 are consistently producing sufficient semen, the birds will be used for artificial insemination 
(A.I) procedures. 

Sperm mediated transgenesis (SMT) of the quail will be performed with two 
plasmid vectors, pRC/CMV-IFNMM-SV40 and P RC/CMV-EPOMM-SV40. Transgenesis 
resulting in the integration of, and expression from, a heterologous nucleic acid encoding a- 

1 0 IFN has been used successfully in chickens with both viral-based and sperm-mediated 
transfer (SMT)-based systems. The second vector will carry the gene encoding for 
erythropoietin. This protein requires more extensive post-translational modification, i.e. 
four glycosylations, than does a-IFN. Both of the plasmid vectors will produce their 
respective expressed polypeptides in serum and in ovo. Assaying for a-IFN or EPO 

1 5 production in serum will begin at two weeks of age and egg production will occur shortly 
thereafter. SMT will be performed with vectors having immunoglobulin heavy and light 
chain under the expression control of a lysozyme promoter. 

About 50 chicks will be obtained from the SMT-AI.'s. Based on results from our 
chicken SMT experiments, at least 2 to 4 transgenic quail for every 50 birds will be 

20 produced from the SMT-AL's. 

6.9 Example 9: Preparation of Female and Male Japanese Quails for 
Sperm-Mediated Transfection by Artificial Insemination 

The birds used will be selected for their optimal age for fertility, according to the 
25 average life history of the quail as shown in Table 1 . 



Table 1: Japanese Quail life history 



Hatching 



16-17 days 



30 



Sexual maturity 

Females 



Males 



Under current conditions 
Under optimal conditions 



48 days 
35-38 days 
35-42 days 



35 



Optimal Fertility 

Females 



60-240 day (8-34 weeks) 
60-280 days (8-40 weeks) 



Males 
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Declining Fertility 

T^rtiKt y declines 30-50% 
Gamete production cessation 

Females (eggs) 
Males Decreases 




9?4 davs (32 weeks) 



20 



, x r>„ a i1 are separated from males and eggs inspected for 

Females- Female Japanese Quail are separaxeu uu in 

will be progressively selected. m,. .vrnaed miail speim will be 

from the animal, as reported by Holm. L. & Wisnart u 
a998) andincorpomtedhereinbyreferencemitsen^. ? 
L ,w8andabout9mamtainsmotilitybetterthandoesapHof7. 
between about 8 and about 9 ^ a25|ll dose 

Artificial insemination (A.I). bacnnenwuiu* 
25 li^^SxlO'^perben-HenswiUtadividedintoGroupl^temale, 

Mntam mg 2.5 x 10 sp p toseotated with semen treated with 

inseminated with semen only; Group 2. 4 temales ins oCMV-IFN- 
UPOFECTAMmE-lGronpS^^es^^ed^™^™ 

SV40; Oroup 4: 4 females sperm-nredUted — ^ L hens will 

remaining eggs will be incubated to hatching. 105 o F _il0<>Fforthe 
A) IfefcUtae «w. Newly hatched chicks will be grown at 105 t * 
(&; HrtcMif* y first and second week 

first 4-5 days. The temperature will then be reduced oy 
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in 2° F increments. By the third week the house temperature will be sufficient A 16/8 
lighting schedule will also be used. 

6.10 Example 10: Quail Semen Collection 

5 The male bird is grasped so its breastbone rests in the palm of the right hand. The 

tail is positioned so the first two fingers of right hand lay on either side of the vent just 
below the legs. Holding the male in an almost vertical position, the left hand gently 
squeezes four times at the base of the cloaca to remove the foamy secretions of the glandula 
proctodealis (foam gland). The vent is wiped to remove traces of the foamy substance and 

10 to prevent contamination of the semen. The left hand maintains firm pressure against the 
base of the cloaca and gently pulls back on dorsal proctodeal wall to achieve erection. 

The first two fingers of the right hand gently massage the abdomen and apply 
moderate pressure just below the vent to force semen from the vas deferens into the 
copulatory organ. The semen will appear shortly thereafter. The viscous, pale yellow to 

1 5 white semen is collected with a 20 ^1 pipette and immediately diluted with 1 50 mM NaCl 
and 20mM N-trisPydroxymethyl]me%l-2-aminoethane-sulfonic acid (TES), at pH 8.0. 

6.11 Example 11: Lipofection of Quail Sperm 

Quail semen will be diluted, immediately after harvesting, to a concentration of 10 8 
20 sperm/ml in 150 mM NaCl and 20mM N-tris[Hydroxymethyl]me1hYl-2-a 

sulfonic acid (TES), pH 8.0 buffer. Semen extender that is optimized for chicken sperm 
may not be used since it rapidly immobilizes quail semen within five minutes of contact. 

The lipofection procedures used with quail sperm will be similar to those adopted 
for chicken lipofection, including REMI sperm mediated tranfections (SMT). With the 
25 chicken SMT procedure, artificial insemination is with approximately 6 x 10 8 sperm. Due 
to the limited amount of semen produced by male quail 1x10 s quail sperm will be used per 
hen. The lowest number of sperm that will still gives maximum insemination will be 
adopted. Typically, the DNA (1.0|ag), restriction enzyme, LIPOFECTAMINE™ (l.O^ig) 
and sperm (10 8 ) will be incubated together at a ratio oil respectively for 30 minutes. All 
30 reactions will be carried out in OptiMEM™ medium (Gibco-BRL, Gaithersburg, MD). 

6.12 Integration of Adeno- Associated Virus (Aav) Inverted Terminal 
Repeats-Flanked Genes Introduced by Sperm-Mediated Transgenesis 

The chromosomal integration of plasmid DNA into the genome of an avian cell will 
35 be mediated by flanking the gene of interest and sequences related to its expression, with 
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AAV inverted terminal repeat (ITR) sequences. A method for gene delivery and integration 
of heterologous nucleic acid sequences into the genomic DNA of a mammalian cell is 
described by Solis et al in U.S. Patent Serial No. 5,843,742 incorporated herein by 
reference in its entirety. A nucleic acid segment will also be included with the gene of 

5 interest that will result in the expression of the AAV Rep protein within the same cell. 
For example, a plasmid nucleic acid vector containing an expression cassette 
consisting of a CMV immediate early promoter driving the expression of human 
erythropoetin, will be flanked by AAV ITR sequences. This plasmid will be introduced by 
sperm-mediate transgenesis into targeted host cells together with a second nucleic acid 

10 vector plasmid. This second plasmid will include an expression cassette comprising the 
CMV immediate early promoter driving expression of the nucleic acid sequence encoding 
the AAV Rep 78 protein. Alternatively, a single nucleic acid vector comprising the 
expression cassette comprising the CMV immediate early promoter driving expression of 
the nucleic acid sequence encoding the AAV Rep 78 protein and the cassette expressing the 

1 5 gene of interest, such as erythropoetin, will be introduced together into an avian male gem 
cell. 

6,13 Example 13: DNA Construct Modification to Improve Germline 
Transmission of Trangenes 

20 Following genetic modification in vertebrates, a low percentage of offsprings 

derived from the founder animals are transgenic given the low number of germline cells that 
carry the transgene. As a result, costly and cumbersome breeding of the founder animals is 
required to expand the number of transgenic animals derived from the original founder 
animals. 

25 Anumber of articles (e.g., Peschon, 1989, Ann. NYAcadSci. 564: 186-197; 

Peschon, 1987, PNAS 84: 5316-5319; Zambrowicz, 1993, PNAS 90: 5071; Braun, 1989, 
Gene Dev. 3:793-802; Rhim, 1995, Biol. Reprod. 52:20-32) as well as patent application^) 
(O'Gorman et al, PCT Publication No. WO 99/10488) have identified and used the 
elements of the protamine promoter necessary for post-meiotic-specific transcription of this 

30 gene. Other spermiogenesis-specific promoters have also been described and used in the 
context of genetic manipulation (Sage, 1999, Mech Dev. 80: 29-39; Vidal, 1998, MoL 
Reprod. Dev. 51 : 274-280). In this example, we take advantage of the specific activity of 
these promoters to selectively mark those sperm cells that have inherited the transgene of 
interest after meiotic segregation. 

35 
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In the example described here, the construct would contain two independent 
elements. In a preferred example, the first element would comprise an oviduct-specie 
promoter, such as ovalbumin, lysozyme, ovomucoid, ovotransferrin, conalbumin, and 
ovomucin. The promoter would drive expression of a gene coding for a protein of interest, 
5 such as a therapeutic protein like Interferon, erythroprotin (EPO). Alternatively, 
constitutive promoters such as CMV or RSV may also be used. 

The second element, located up or downstream from the first, would contaui the 
protamine promoter, or a segment of this promoter that is sufficient to drive the expression 
of a marker gene. In a preferred example, the protamine promoter would drive the 
10 expression of amarker, preferably a vital and color marker, such as the Green Fluorescent 
Protein (GFP). In such example, those sperm cells that have inherited the transgene would 
be vitally labeled during the late stages of spermiogenesis with the expression of the GFP 
protein. Given that the construct used contains both the first and the second elements 
described above, positive sperm cells would also contain the transgene of interest. 
15 Large numbers of positive sperm cells expressing the GFP proteins could be isolated 

using Fluorescent Activated Cell Sorting (FACS). These sperm cells could subsequently be 
used to breed hens by described artificial msemination protocols. (Etches, 1996, Mol. 
ReprodDev 45:2918). In cases where the number of positive sperm after FACS isolation 
is low and insufficient for AI, me females could be bred through intramagnal insemination. 
20 (Engel, 1991, Poultry Sci. -70: 1965; Trefil, 1996, Br. Poult. Sci. 37: 661-664). 

Alternatively, small numbers of positive sperm cells could be isolated under a microscope 
using UV light and injected into unfertilized eggs via described Intracytoplasmic Sperm 
Injection aCSI) protocols. (Perry, 1999, Science 284: 1180-83). 

25 6.14 Example 14: Use of Chicken Centromeric and Telomeric Sequences to 

Create a Chicken Artificial Chromosome (ChAQ 

The Shemesh et al. procedure (2000, Molecular Reproduction and Development 56: 
306-308) for introducing linearized plasmid DNA into chicken sperm appears to rely on 
vector sequences which include an SV40 origin of replication. It is possible that the 

30 exogenous DNA therefore replicates as an episome and would most likely be lost m 
subsequent cell divisions due to improper segregation at mitosis. To insure proper 
segregation at mitosis, chicken centromere and telomere sequences could be included in the 
transgenic construct. Chicken centromere and telomere sequences could be obtained on a 
BAC (bacterial artificial chromosome) library clone from Texas A&M University or Martin 

35 Groenen at Wageningen Agricultural University, The Netherlands. The SV40 origin of 
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conalb™to,aodovomucm,=<c.)a»dt M1S ge M (.. e . IFN.EPO human 

Ivy and Ugh. chains, OM-CSF, etc.) combination couid be oloned mti, the BAC 
heavy ana ugnr Tki.nArwouldtberefbrecontainanonginof 

telomeres, the construct would replicate ana segregate 
(ChAC). 

,0 6.15 Example 15: Construction of Lyaoryme Promoter Plaamid, 

ThecWekenlyao^egeneex^conh.lreg.on^.so^byPW 

^cation ^--^^SSSST— 

in an avian cell. amplified using the 

White Leghorn Chicken (Ga/fctf g« gnomic DNA was amp 
<; T MAM f SEQ ID NO" l)andLE-6.1kbrevl (SEQ ID NO: 2) in a first reaction, 

and Lys-6.1 (SEQ ID NU. 3; y for j 

^eregel purified, and then united in a third PCR reaction using only 5pLMAR2 
rCLvSlrev(SBQm N 0:4)asprime K a^nlO-mJnure^rWThe 

Entire of me ve^rPB^CB^KS.re^tingmureplasm.dW 

pl2.0-lys was used as a template in a PCR reaction w* prnnera 5pLMAR2 (SEQ 

Cccc:™ 

30 m^on time, Tbe rating D NA was 

into the EcoK V restriction site of pBUOTT® KS, fomung plaannd pUOlys B^ 

nUO-lvs-LSPm^waamgesttd^mSo/landtheSaBtoNoflpnmerCS- 
35 CS^-BMSEQ^NOa^am^mmed^plasmm, Mowed 
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by Not I digestion. The resulting 12.5 kb Not I fragment, comprising the lysozyme promoter 
region linked to IF>MAGMAX-encoding region and an SV40 polyadenylation signal 
sequence, was gel-purified and ligated to Not I cleaved and dephosphorylated 
PBLUESCRPT® KS, thereby forming the plasmid pAVIJCR-Al 15.93.1 .2, which was then 
5 sequenced. 

6.16 Example 16: Construction of Plasmids Which Contain the 3 ! Lysozyme 
Domain 

The plasmid pAVIJCR-Al 15.93.1.2 (containing the -12.0 kb lysozyme promoter 

10 controlling expression of human interferon a2b) was purified with a QIAGEN® Plasmid 
Maxi Kit (QIAGEN®, Valencia, CA), and 100 ng of the plasmid were restriction digested 
with Not\ restriction enzyme. The digested DNA was phenol/CHCl 3 extracted and ethanol 
precipitated. Recovered DNA was resuspended in ImM Tris-HCl (pH 8.0) and O.lmM 
EDTA, then placed overnight at 4°C. DNA was quantified by spectrophotometry and 

15 diluted to the appropriate concentration. The DNA samples were bound to the SV40 T 
antigen NLS peptide by incubation for 15 minutes. 

The plasmid pAVIJCR-Al 15.93.1.2 was restriction digested with Fsel and blunt- 
ended with T4 DNA polymerase. The linearized, blunt-ended pAVIJCR-Al 1 5.93 . 1 .2 
plasmid was then digested vAihXhol restriction enzyme, followed by treatment with 

20 alkaline phosphatase. The resulting 15.4 kb DNA band containing the lysozyme 5' matrix 
attachment region (MAR) and -12.0 kb lysozyme promoter driving expression of a human 
interferon was gel purified by electroelution. 

The plasmid plllilys was restriction digested with Mwl, then blunt-ended with the 
Klenow fragment of DNA polymerase. The linearized, blunt-ended plllilys plasmid was 

25 digested with Xhol restriction enzyme and the resulting 6 kb band containing the 3' 
lysozyme domain from exon 3 to the 3' end of the 3' MAR was gel purified by 
electroelution. The 15.4kbbandfrompAVIJCR-A115.93.1.2andthe6kbbandfrom 
pTTTilys were ligated with T4 DNA ligase and transformed into STBL4 cells (Invitrogen Life 
Technologies, Carlsbad, CA) by electroporation. The resulting 21.3 kb plasmids from two 

30 different bacterial colonies were named pAVIJCR-A212.89.2. 1 and pAVUCR-A212.89.2.3 
respectively. 



35 



78 



WO 03/024199 



PCTAJS02/30156 



6.17 Example 17: Cretan of an ALV-based Vector Having p-lactamase 
Encoding Sequences _ « A 

The lacZ get* of pNLB, a r^ennon-deficien. avian leukosis virus 

5 cona Jng of a cytomegalovirus (CMV) promoter and me reporter gene fib*— ^ 
M M> To efficiently replace the UkZ gene ofpNLB with a ttanagene, an 

te ohewedhaoa^paIfe 9 nea t ofp N LB(Co SS et e ,a;.,199M. M . 65.3388-94) 
W t S ft. y IpA ^ reside 289 bpupshnam of tocZanddte J - *d sues resrde 3 of 

) The fdled-in IM»U fragment of pCMV-BL (Moore « * Ami B.ochm. 247. 203 

W7)) ™i I ^i 0 »d.eel^-ba*^*I^''^^^ f 
^ltheCMVpro m o K ra^ tt te M gene(inp^B,^.re 5 ides«7b P u P «f 

tS/acZand^IresidealOOb^up^anrof^laeZs^eodon^erebyc^gP^^ 

Adap^-Cl^-BL.To CT ea te pN M ^MV-BL>«ffl^Iin^°fP^(-« 

rites ofpNLB yielded mostly rearranged subclones, for unknown reasons. 

20 

6.18 Ea»mpl«18:Pr.d..«...fTr»nad«ctionPartlc.e S H»vi.g.»ALV- 

baaed Vector Having p-lactamase Encoding Sequences 

Sen to aadMdeswe re ad ta redinF10(OBCO<l>) ; 5%newbomcalfsen ! m 

(GBCO®), 1% cbicken serum (GIBCO®), 50 pg/ml pbleomycin (Cayla Ubomtones) and 
25 Opg ta lbygromycin(SIGMA®). Traduction particles were produced as desenbedm 
3 « I 1991, herein bcotporated * ref — ■ ™* *<= '""o^ 8 «*— 

L 9 x 10> Semas, virus was harveried in bash media for 6-16 boors and filtemd. All of 
,i em ediawasused,obansduce3 x l* Isoldes hruuee 100 ™p"^ 
30 Thefollowingdnymemed.awasrepU.dw.m 

(SIGMA®). After 10-12 days, single 0418' colonies were isolated and unnsferred to 24- 
wellpla*, After 7-10 days, dters from each colony was de«rnnned by tmnaducftono 
Semes followed by 0418 selecuon. Typically 2 on. of 60 colonies gave M at 1-3 x . 
35 Tho S acolome S wereexpanded m dmevirusconcenna«edb ) 2-7xlO aadescnbedm 
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Allioli et al, 1994, Dev. Biol 165:30-7, herein incorporated by reference. The integrity of 
the CMV-BL expression cassette was confirmed by assaying for p-lactamase in the media of 
cells transduced with NLB-CMV-BL transduction particles. 

5 6.19 Example 19: pNLB-CMV-IFN Vector Having an IFN Encoding 

Sequence 

The DNA sequence for human interferon <x2b based on hen oviduct optimized codon 
usage was created using the B ACKTRANSLATE program of the Wisconsin Package, 
version 9.1 (Genetics Computer Group. Inc., Madison, WI) with a codon usage table 
10 compiled from the chicken (Gallus gallus) ovalbumin, lyso2yme, ovomucoid, and 

ovotransferrin proteins. The template and primer oligonucleotides (SEQ ID NOS: 14-31) 
shown in Figures 8 A-B were amplified by PCR with Pfu polymerase (STRATAGENE®, La 
Jolla, CA) using 20 cycles of 94°C for 1 min., 50°C for 30 sec, and 72°C for 1 min. and 10 
sec. 

1 5 PCR products were purified from a 1 2% polyacrylamide-TBE gel by the "crush and 

soak" method (Maniatis et al 1982), then combined as templates in an amplification 
reaction using only EFN-1 (SEQ ID NO: 21) and DFN-8 (SEQ ID NO: 3 1) as primers. The 
resulting PCR product was digested with Hind EI and Xba I and gel purified from a 2% 
agarose-TAE gel, then ligated into Hind HI and Xba I digested, alkaline phosphatase-treated, 

20 PBLUESCRIPT® KS (STRATAGENE®), resulting in the plasmid pBluKSP-IFNMagMax. 
Both strands were sequenced by cycle sequencing on an ABI PRISM 377 DNA Sequencer 
(Perkin-Elmer, Foster City, CA) using universal T7 or T3 primers. Mutations in pBluKSP- 
IFN derived from the original oligonucleotide templates were corrected by site-directed 
mutagenesis with the Transformer Site-Directed Mutagenesis Kit (Clontech, Palo Alto, 

25 CA). The interferon coding sequence was then removed from the corrected pBluKSP-IFN 
with Hind JR and Xba 1, purified from a 0.8% agarose-TAE Gel, and ligated to Hind HI and 
Xba I digested, alkaline phosphatase-treated pCMV-BetaLa-3B-dH. The resulting plasmid 
was pCMV-IFN which contained IFN coding sequence controlled by the cytomegalovirus 
immediate early promoter/enhancer and SV40 polyA site. 

30 To clone the IFN coding sequence controlled by the CMV promoter/enhancer into 

the NLB retroviral plasmid, pCMV-IFN was first digested with CM andXbal, then both 
ends were filled in with Klenow fragment of DNA polymerase (New England BioLabs, 
Beverly, MA). pNLB-adapter was digested with Nde I and Kpn I, and both ends were made 
blunt by T4 DNA polymerase (New England BioLabs). Appropriate DNA fragments were 

35 
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resulting plasmid was pNLB-adapter-CMV-IFN. 

digest of pNLB-CMV-EGFP, creating pNLB-CMV-lFN. 

«0 Exampl.20: Production . f pNlB^MV-IFNTra,. S d«c«..Parflel« 

!0 Sena packaging cells (Coss*. e, aL, 1991) were plated a, a density of 3 x 10^ 

lIwLnLeered24hane r pla to gwin 1 2 M ofC S Cl. P urt a ed P NLM^-ffND r ,A 

cellsw nans . ,. « n ife Technologies) in a final volume of 500 ul Opttmem 
15 and 6 pi ofLipofecta liposomes (Life Technologies) in 

^Technologies). The plams were gently rocked for four hours at 37 Cina5/.C 0l 
^^orLLu^m^wnsremoved.wnshedoncewimlrmofOpema^ 

^mlhygromycfe-SO.gWphl^voi, Tnenexrday.memumfiomhnnaf^ 
20 Sentaswnsr^veredaKifilteredthrougha0.45micronfilter. 

Tins medium was dien used ro tratsduca Isolde cell, 03 ml of me fiiteredmedrum 
^varedfi^S^c d lswna^ to 9.6nn„fF-10(MfeTechnclog i e S )^lem^ i 
TLcnhed ahove, in addition ro po.yhrene (SIGMA®) a, a final conconhnhon of 4 pg/ml. 

200 pg/ml neomycin <04 18 , SIGMA®). Every odier day, the medium was replaced wtm 

i j c ft o/„ palf serum 1% chicken serum, 50 ^g/ml 
fresh F-10 medium supplemented with 50 A call serum, 

30 ^m UsO^mlphlcomycmand^O^neornycirr ^ 

holmes were vab.ehy^and^wempicWandp^nr.Jv^ 

When some of the 24 weU dishes became confluent, medium was harvested and tttared to 
detenmnethecelllmeawimttoUghestproducfionofrenovirua. 

T heringwa S perfonnedhyp 1 anng 7 .5xlO<Sen a cell S perweUm24w.UpIa«son 

35 tedaypnortovMharvestandtransducuon. ThenextdaylmloffreshF-lOmedmm 
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supplemented with 50% calf serum, 1% chicken serum, 50 /ig/ml hygromycin, and 50 
^g/ml phleomycin was added to each well of the isolated Isolde colonies. Virus was 
harvested for 8-10 hours. The relative density of each well of Isoldes was noted. After8-10 
hours, 2 and 20^1 of media from each well of Isoldes was added directly to the media of 
5 duplicate wills of the Sentas. Harvested medium was also tested for Ihe presence of 
interferon by IFN ELISA and for interferon bioreactivity. The next day the media was 
replaced with F-10 medium supplemented with 50% calf serum, 1% chicken serum, 50 
//g/ml hygromycin, 50 //g/ml phleomycin, and 200 Mg/ml neomycin. When obvious 
neomycin-resistant colonies were evident in the wells of transduced Sentas, the number of 

10 colonies was counted for each well. 

The Isolde colony producing the highest titer was determined by taking into account 
the number of colonies and correcting for the density of the Isolde cells when the viral 
particles were harvested (i.e., if two Isolde colonies gave rise to media with the same titer, 
but one was at a 5% density and the other was at a 50% density at the time of viral harvest, 

1 5 the one at the 5% density was chosen for further work, as was the case in the present 
example). 

The Isolde cell line producing 1he highest titer of IFN-encoding transducing particles 
was scaled up to six T-75 tissue culture flasks. When flasks were confluent, cells were 
washed with F-10 medium (unsupplemented) and transducing particles were then harvested 

20 for 16 hours in 14 ml/flask of F-10 containing 1% calf serum (Atlanta Biologicals) and 0.2% 
chicken serum (Life Technolocyies). Medium was harvested, filtered through a 0.45 micron 
syringe filter, then centrifuged at 195,000xg in a Beckman 60Ti rotor for 35 min. Liquid 
was removed except for 1 ml, and this was incubated with the pellet at 37°C with gentle 
shaking for one hour. Aliquots were frozen at -70°C. Transducing particles were then 

25 titered on Senta cells to determine concentrations used to inject avian sperms. 

6.21 Example 21: Construction of Lysozyme Promoter Plasmids 

The chicken lysozyme gene expression control region isolated by PCR amplification 
is fully disclosed in U.S. Patent Application Serial No. 09/922,549, filed August 3, 2001 

30 and incorporated herein by reference in its entirety. Ligation and reamphfication of the 
fragments thereby obtained yielded a functionally contiguous nucleic acid construct 
comprising the chicken lysozyme gene expression control region operably linked to a 
nucleic acid sequence encoding a human interferon o2b polypeptide and optimized for 
codon usage in the chicken. Briefly, chicken (Gallus gallus (White Leghorn)) genomic 

35 DNA was PCR amplified using the primers 5pLMAR2 and LE-6.1kbrevl in a first 
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fcr 6 minutes, for 30 cycles .sing TAQ PLUS PRECISION™ DNA polymer 
(STRATAGENE®, La Joila, CA). The PGR products from these <wo reactor* were gel 
5 Led.ardmenum.edurarWPCRreac.ionasiagon.ySpLMAia 

U^ed.andcloned^Ae^RVresWctionsi.e of the vector rBLUESCMW® KS, 

resulting in the plasmidpl2.0-lys. rwAT>0 ,i 

pl 2.0-lys was used as a template in a PGR reaction with primers 5pLMAR2 and 
!0 LYSBSUaudalOuuuuKex^ouume.TheresuldugDNAwaspl.osphorylaMgel- 

""'"'Til' Olys-B was restriction digested with M end fto36 L gel-purified, and cloned 

15 .12 0-lys-LSPIFNMM was digested with &/ 1 and the SalltoNotl primer was pealed to 
L digested plasmid, followed by Not I digestion. The resulting 12.5 hbWIfragmam, 

dephosphorylated pBluesoipt. KS, thereby forming the plasnnd pAVIJCR-Al 15.93.1.2. 

20 

622 Example 22: Complete Lysozyme Promoter and IFNMA.GMAX 

Sequences . . 

The complete sequences of me lysozyme gene promoter and the codonoptumzed 
human interferon <r2b nucleic acid ate fully disclosed in U.S. Patent Application 
25 09/922,549, filed 03 August 2001 ami incorporated herein b, reference m tts entirety The 
complete nucleotide sequencu of*, approximately 12.5 kb chicken ^zyme promoter 

JlysozymesigualpepfidMoftesequenceeuc^gtegeuel^GMAXandme 

subiuen. polyadenylafion signal sequence. The IFNMAGMAX nuderc actd ttequenoe 
30 wbl.synuteaizedasdescnbedmExamp.e21 above. The expressed IFN a2b sequence 
within plasmid pAVUCR-A115.93.l-2 functioned as a reporter gene for lysozyme promoter 
activity. This plasmid construct may also be used for production of interferon u2b m the 
egg white of transgenic chickens. 



35 
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6.23 Example 23: Synthesis of the MDOT promoter construct 

Amplification of the ovomucoid and ovotransferrin promoter sequences 

Oligonucleotide primers 1 (SEQ ID NO: 32) and 2 (SEQ ID NO: 33), as shown in 
Figure 9 were used to amplify me ovomucoid sequences. Oligonucleotide primers 3 (SEQ 
5 ID NO: 34) and 4 (SEQ ID NO: 35) were used to amplify the ovotransferrin sequence by 
PCR. The primers were designed such that the PCR-amplified ovomucoid sequences 
contained an Xho I restriction cleavage site at the 5 ' end and a Cla I site at the 3 ' end. 
Similarly, the PCR-amplified ovotransferrin product had a Cla I restriction site at the 5' end 
and a Hind. EE site at the 3' end. The overlapping Cla I site was used to splice the two-PCR 
10 products to create the MDOT promoter construct. The nucleic acid sequence SEQ ID NO: 
1 1 of the MDOT promoter construct is shown in Figure 1 1 . The final product was cloned m 
a bluescript vector between the Xho I and Hind m sites. From the bluescript vector the 
promoter region was released by Kpn VMnd HI restriction digestion and cloned into the prc- 
CMV-IFN vector to replace the CMV promoter to create MDOT-IFN (clone #10). This 
1 5 plasmid was tested in vitro. 

624 Example 24: Testicular Injection 

5 weeks old White Leghorn male chickens were anesthetized using Isoflourane. 
Small incision was made between the last two ribs to expose the testes. A 5-10 pi virus 
20 suspension of pENHX^MV42GFP/VSVg (9 x 10* per ml) was injected into either both 

testes or only one of the testes. 

At 20 weeks of age, semen samples were collected. Only one bird had sperm in his 
semen Genomic DNA was isolated from the semen and used to amplify the transgene 
(CMV-EGFP) by PCR reaction using different DMSO concentrations. The samples were 
25 separated on agarose gel, transferred onto nitrocellulose membrane and hybridized with 
EGFP probe. As shown in Figure 1 1, EGFP positive bands are detected at two different 
DMSO concentrations suggesting that (1) specific PCR conditions are required for the 
amplification of the transgene and (2) the sperm samples have incorporated the ixansgene in 
their genome. 

30 

F QTTTVALENTS 

Reference now will be made in detail to the various embodiments of the invention, 
one or more examples of which are illustrated in the accompanying drawings. Each 
example is provided by way of explanation of the invention, not limitation of the invention. 
35 In fact, it will be apparent to those skilled in the art that various modifications, 
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combinations, additions, deletions and variations can be made in the present invention 
without departing from the scope or spirit of the invention. For instance, features illustrated 
or described as part of one embodiment can be used in another embodiment to yield a still 
further embodiment. It is intended that the present invention covers such modifications, 
5 combinations, additions, deletions and variations as fall within the scope of the appended 
claims and their equivalents. 
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What is Claimed Is : 

1 . A method of generating a transgenic avian zygote by sperm-mediated transfection, 
said method comprising: 

(a) obtaining a suspension of avian male germ cells selected from the group 
5 consisting of spermatozoa and spermatozoal precursor cells; 

(b) introducing a nucleic acid comprising a transgene comprising a nucleotide 
sequence encoding a heterologous polypeptide to the avian male germ cells 
by lipofection, electroporation or restriction enzyme mediated integration; 

(c) delivering the avian male germ cells having the nucleic acid to an avian 
10 oocyte, 

thereby generating a transgenic avian zygote having the nucleic acid incorporated therein. 



2. The method of Claim 1, wherein the avian male germ cells and the avian oocyte are 
obtained from a chicken. 

15 

3 . The method of Claim 1 , wherein the avian male germ cells and the avian oocytes are 
obtained from a quail. 

4. The method of Claim 1 , wherein the nucleotide sequence encoding said 

20 heterologous polypeptide is operably linked to a transcriptional regulatory element that can 
direct gene expression in one or more cells of said transgenic avian. 

5. The method of Claim 4, wherein the transcriptional regulatory element is selected 
from the group consisting of the promoter regions of the avian genes encoding ovalbumin, 

25 lysozyme, ovomucoid, ovomucin, conalbumin and ovotransferrin. 



6. The method of Claim 5, wherein the selected nucleic acid further comprises a 
chicken lysozyme gene expression controlling region comprising the nucleotide sequence of 
SEQIDNO: 7. 

30 

7. The method of Claim 4, wherein the transcriptional regulatory element is a tissue 
specific promoter. 

8. The method of Claim 7, wherein the tissue specific promoter is specific for the 
35 magnum. 
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9. The method of Claim 1, wherein the transgene comprises at least one 
cytomegalovirus promoter. 

10 The method of Claim 9, whereinthe transcriptional regulatory element comprises at 
5 least two regions derived from the promoter of an avian gene, said regions bemg from a 
different promoter. 

11. The method of Claim 10, wherein the transcriptional regulatory element has the 
nucleotide sequence of SEQ ID NO: 1 1 . 

10 

1 2. The method of Claim 1 , wherein the transgene comprises at least one matnx 
attachment region (MAR). 

13. The method of Claim 12, whereinthe transgene comprises a 5' MAR and a 3' MAR 
1 5 which flank said nucleotide sequence. 

14 The method of Claim 1, wherein me heterologous polypeptide is selected from the 
group consisting of a cytokine, a hormone, an enzyme, a structural polypeptide, and an 
iromuoglobulin polypeptide. 

20 . ri 

1 5 The method of Claim 14, wherein the cytokine is selected from the group consrstmg 
of interferon, interleukin, granulocyte colony-stimulating factor, granulocyte-macrophage 
colony-stimulating factor, stem cell factor, erythropoietin, thrombopoietin, and stem cell 
factor. 

25 

16. The method of Claim 15, whereinthe cytokine is an interferon. 

17. The method of Claim 1, wherein me transgene comprises an internal ribosome entry 
site (IRES). 

30 18. The method of Claim 17, whereinthe transgene comprises at leasttwo nucleotide 
sequences each encoding a heterologous polypeptide. 

19. The method of Claim 18, wherein the at least two nucleotide sequences encode at 
35 least two heterologous peptides that form a multimeric protein. 



-87- 



WO 03/024199 



PCT/US02/30156 



5 



20. The method of Claim 19, wherein the multimeric protein specifically binds a 
selected ligand. 

21 . The mefcod of Claim 20, rvherein the multimeric protein is an antibody. 

22 The memc4 of Ctaim 1. wherein the heterologous poiypeptide comprises a pepude 
region suitable for the isolation of the heterologous polypeptide. 

M . The method of Claim 1, wherein the nucleic acid is a euiaryouc viral vector. 

Z m oo consisting of avian !eukosis vires, adenovirus, hanrfemn-polylysme enhanced 

murine leukemia virus-derived vectors. 
15 25. ThememodofClaiml.wheremmenncleicacidisaplasrmdvecto, 

26. ^emefcodofClaiml,^ 
(BAC). 

20 

27. The method of Claim 1, wherein the nucleic acid is not a eukaryotic viral vector. 



28. The method of Claim 4, wherein the transcriptional regulatory element is a 
25 regulatable promoter. 

2, The memod of Claim 6, whereinthe selected nucleic acid further comprises a region 

SEQIDNO: 9. 

30 30. ThememodofClaiml,whereinmenucleotidesequenceencodmgsaid 
heterologous polypeptide comprises an origin of replication. 

31. ThememodofClaimSO^^ 
35 replication. 
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32 Tie method of Claim 1, wtarata<h= »**> «« «• »**<> fom «" f"? 
c^Mngofalinear nucleic ac,4 a pasnud. a viral nucleic acid, and anarbnctal 

chromosome. 

5 33. Tl.^rf<^al*^4•-^'*- wto " l, ^• 
centromere and optionally a telomere. 

34 The medtod of Claim 32, wherein ft. linear nucleic acid has at least one cohesive 
eadcharac.etized^mecohe.tveendgeneta^byatestfctionendonuclease. 



10 



35. ncmemodofCUim32,whe re mmelir^nt K leicaddhasatleastoneblnn.end. 

36. The method of Claim 34, wherein the at leas, one cohesive end is generated by 
chemical synthesis. 

15 37. ThememodofClaim34,whereinthcatleas.onecohesi,eendtsgenera tt dbyan 
enzyme other than a restriction endonuclease. 

38. The method of Claim 34, wherein the at least one cohesive end is generated by a 
20 combination of chemical and enzymatic methods. 

39 . The memod of Claim 1, wherein the nucleic acid is introduced to the avian male 
germ cells by restriction enzyme mediated integration 

25 40 n.^-C^39.to«^*'«--^» to ^r b 
g^oellsarastticnonondonnCease capable of cleaving me genomtonuolacactd of to 

avian male germ cells. 

41. TtemefcttlofClaim^ 

30 restriction endonuclease to the avian male germ cells. 

42. ttemethodofClaiml,^ 
cells by adeno-associated virus-derived vector. 



35 
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43 . The method of Claim 42, wherein the nucleic acid is bounded by inverted terminal 
repeat sequences. 

44. The method of Claim 42, wherein the nucleic acid is bounded by inverted terminal 
5 repeat sequences derived from an adeno-associated virus-derived vector. 

45. The method of Claim 42, wherein the adeno-associated virus-derived vector further 
comprises a transcription cassette capable of expressing an adeno-associated virus Rep 
protein. 

10 

46. The method of Claim 45, wherein the Rep protein is Rep 78. 

47. The method of Claim 45, wherein the nucleic acid bounded by inverted terminal 
repeat sequences is inserted in a first nucleic acid vector and the transcription cassette 

15 capable of expressing an adeno-associated virus Rep protein is inserted in a second nucleic 
acid vector. 

48. The method of Claim 1 , further comprising the step of irradiating the avian male 
germ cells, thereby cleaving the nuclic acid, wherein the radiation is selected from the group 

20 consisting ofultraviolellight, gamma rays, X-rays, and ultrasound, 

49. The method of Claim 1 , wherein the avian oocyte is an isolated oocyte, and wherein 
the avian male germ cells having the nucleic acid are delivered to the isolated oocyte by a 
method selected from the group consisting of microinjection, intracytoplasmic sperm 

25 injection (ICSI), and artificial insemination. 

50. The method of Claim 49, wherein the avian male germ cells having the nucleic acid 
therein are delivered to the nucleus of the oocyte. 

30 51. The method of Claim 1 , wherein the nucleic acid forms an episome in the avian 
male germ cells. 

52. The method of Claim 1 , wherein the nucleic acid in the avian oocyte is an episome. 

35 
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53. 



ThememodofClaiml^er comprising isolating ^ avian oocyte from the 



female of an avian by: fore and 

(a) removing an ovum from a bira alter ovuia 

(b) removing an albumen layer from the ovum. 

5 54. The method of Claim 1 , further comprising the steps of: 
fa* fistulating an avian female; 

10 avian female as a shelled egg; and 

(c) incubating the shelled egg until said shelled egg hatches, 
thereby producing a transgenic avian containing the transgene. 



Lone 



15 55. WmeAodofCUimS^wh^fcta^logouspolypepadeisexp^i,. 
or more cells of said transgenic avian . 

56 . ^ m eftodofCla i m55,»he I dn te he«erolo g ou S polypep« ei sexp reSS edin«he 
serum of said transgenic avian. 
20 57 . T i em.«hodofCla im 55,wh erci nd K he Kro logo US polypepddei S expr^ind 1 e 
magnum of said transgenic avian. 

58. Tl.n.eftodofCtoH^roon.pn^d.e^pofa.lowtagd.^gcnic 

25 avian to develop to sexual maturity. 

59 llem e ft odo f aa ta 58,v*ere i n4ehe,e K ,logous P olypepdd e i S d e Uv e ,=d,od 1C 

wbiteofadeveloptagavianeggpioducedtyttottansg^"™- 

30 60 Th eme todofC 1 aiB 1 55or59^compnsingUoladn gS aid tote otogou S 
poi^epdd.ta.aid^genicaviaaoraa.ggprcducedby.he^gemcav.aa 

35 using a eukaryotic viral vector. 
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62. A transgenic avian produced by the method of Claim 54. 

63. The transgenic avian of Claim 61 or 62, wherein the avian is a chicken. 

5 64 The transgenic avian of Claim 63, wherein the heterologous polypeptide is selected 
from the group consisting of a cytokine, a hormone, an enzyme, a structural protein, and an 
immunglobulin polypeptide. 



10 



65. The transgenic avian of Claim 63, wherein the cytokine is an interferon. 

66. The transgenic avian of Claim 61 or 62, wherein the transgenic avian produces a 
heterologous multimeric protein. 

67. The transgenic avian of Claim 66, wherein the heterologous multimeric protein 
1 5 specifically binds a selected ligand. 

68. The transgenic avian of Claim 66, wherein the heterologous multimeric protein is an 
antibody. 

20 69. An avian egg produced by the transgenic avian of Claim 61 or 62. 

70. An avian egg produced by the transgenic avian of any of Claims 63-68. 

71. A heterologous protein heterologous protein produced by the transgenic avian of 
25 Claim 61 or 62, wherein the heterologous protein comprises a heterologous polypeptide 

selected from the group consisting of a cytokine, a hormone, an enzyme, a structural 
protein, and an immunoglobulin polypeptide. 

72. The heterologous polypeptide of Claim 71, wherein the cytokine is an interferon. 

30 

73. The heterologous protein of Claim 71, wherein the heterologous protein is a 
multimeric protein. 

74. The heterologous protein of Claim 71, wherein the heterologous protein is an 
35 antibody. 
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SEQ ID NO: 6 



.^-.Tf-rr TTCTTGCCGA XGAAAGGAXA 60 
TGCCGCCTTC TTTGATAXTC ACXCXGXXGX M-CATCTC "CT^ 

TAACAGXCXG XAXAACAGXC TGTGAGGAAA TAC.TGGTAT ^ ACAAAGCC CA 180 

ATAAGTAATG TTGAATAXXG ^SgS'S^SSgG AGAGGXXXXX TTGCCTGTTT 240 
CAGCAGGTGG XGGXXGGGGX GGTCGCMCT JAG-GAC xXAGXAAVTX TTCTACTGGA 300 
TTTTTTTTTT TTTTTTTTTT ^AAGGTG ^CxTTT GAACCTTTTG GAAACTGTAC 360 
CTGTATGTTT TGACAGGTCA «J*2™S SgTGcSaT GCCTTTGGTT CXGAXXGCAX 420 
AGCCCXXXXC TTTCATTCCC TTTTTGCTTT CTGTGCCAA QXGXGGCXXG AAAGCTTGGA 480 
TATGGAAAAC GXXGAXCGGA ACTTGAGGTT JTiATTTAXA TTATTTTTTC 540 

TAGCTGTTGT XACACGAGAX ACCTXiUXAA gx.AGGCCA ^ q^ttAGATTT 600 

CCTTTGAAGT AGXGAGCGTT CXCTGGXXIX JTTCCTT^ TQ TAAATGTTTT 660 

TTCTAATGGG ATTTTTTACC TGATGATCTA ^AXACC ^ Q ATCTGTGTTT 720 
CCTAGTTAAC ATGXXGAXAA CTTOGGATTT ACAiGTTGI CCTT TTTTT TTATC 780 

CTAGTAAAAA TATATGGCAT TTATAGAAAT ACGXAAXXCC X ^ ATAQAATTTT 8 40 
TCTATGCTCT GXGXGXACAG GTCAAACAGA ^CXCC^ ^ TCCTAGAGCG 900 

ATATGCAGTC XGXCGXIGGX ^XXGXGXXG JAAGGATACA ^ TTTGGCTGCT 960 

ATGCTCAGTA AGGCGGGTTG TCACATGGGT ^AAATGTAA CGCTTCAGAT 1020 

GCCTTCCCGA GATCCAGGAC ACTAAACTGC gC^CACTG CT TTCTAAAATA 108 0 

CCCAGGGAAG TGCAGATCCA CGXGCAXAXX CTxAAAGAAO TTGGTAACGG 1140 

TTTTGGCATA GGAAGCAAGC TGCATGGATT TG.xTGGGAC ^ TATGCAGAAG 120 0 

AGTGCATAGG, TTTTAAACAC AGTTGCAGCA TGCTAACGAG CATTGCAGAT 1260 

TGATGCCTGG ATGCCTGTTG «GCTOTT» ^ACTGCC XX Q TCCCGGAACA 132 0 
AGGGGTGGGG TGCTTTGTGT CGTGTTCCCA CACGCXGCCA TTACTATGAA 1380 

CATCTCACCT GCXGGGXACX TXXCAAACCA TC-iAGCAGi GT XGGGCAAA 1440 

ACAGAGAAGX TCCTCAGTTG ^ATTCTCA ^oAXGXCX ^ TCCTTTCTAT 1500 

GXAXGAXAAA GCAXCXCXAT ^GTAAAXTA ^ACXXGXT ^ CTTCAATCTT 15 60 

AGCACCACIX AXXGCAGCAG GTGXAGGCXC TGCTGTGfaU. GTAAACAGTA 1620 

XXAAAGCXXC XXTGGAAAXA CACTGACXXG ^xGAAGXCX ■ GCCTATTCAT 1680 

CXXACCXXXG ATCCCAAXGA AAXCGAGCAX TxCAGXXGXA ^ CAAGTAATAG 174 0 

ACCAXGXAAX GTAATTXXAC ACCCCCAGXG ^CACACX TMTTX aaacTGTGCA 1800 

ACXTTGGCCX CACCCXCXXG TGXACXGXAX TTxGTAATA AAAAXGAGGA 1860 

TAXGATXAXX ACAXXAXGAA AGAGACAXXC TGCXGAXCXX ^ „ AAAAAAAA 192 0 

GXGCGXGXGC X7TXAXAAAT ACAAGXGAXX ^AAAXX^ tqaAAXACAX XCCXAXXXGG I960 
AAAAAAAAAG lAAXAXAAAA AGGACCAGGT ^TTALftA c ATAAGGC TGX 2040 

XAAACAGTTA CATXTXXAXG AAGAXXACCA GCGCXGCXGA CTT ^ „ 0 

AXXGTCTXCC XGXACCAXXG CAXXXCCXCA TTCCCAAXXX ^ GAATGCAGAG 2160 

ACXAXXCAAG AAATGGCTTT GAAATACAGC gCGGAGCXX c ^qgaXTTX 2220 

XXGCACXGCA AAATGTCAGG AAAXGGAXGX CxCXCAGAAX G CXGAAXGXX 2280 

AXAXGXGXAT AXAGXAAGCA GTTTCCXGAT TCCAGCAGGC CAR ACXGCGGAXX 2340 

GXGXXGCCGG AGACCXGXAX TXCXCAACAA JGiAAGAXGG GXXXCATCAX 2400 

XTAATACAXX IXCAGCAGAA GXACTXAGTT AAxCXCXACC ^ CCXXTXTTTC 2460 

TTXTAGATGT XAXACXXGAA AXACXGCAXA ^XXXXAGCX TC TTAAACTGCA 2520 

AGCCXXXAGG AGACXGXXAA JCAAXXXGCX GTCCAACXXX ^ ^acXTTTG 2580 

AXAGXAGXTX ACCXXGXATX JAAGAAATAA ^ACCAXX cAXXAXGXCA GXXCXGXCAG 2640 
SS SSSSS SSSSS CAGTGGAGXX ACAGCXGCGG XXXXGAXGCX 2700 
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^T-vrrr CTGAAACTAG AAATGATGTT GTCTTCATCT J^^^ GTCAACTTTT 2820 

SStgX gctagtgaga aatgcataca tttattgata cxtttttaaa cttctg ^ tc 

tScAGATTT TTTTTTCATT TGGAAATATA TTGTxTTCTA aCT TAAAC TTCAT 2940 

tc-aStgcag tctgattggc atgaagaagc acagcactct t ca GTGCGCTC ag 3000 

S'tSStGA AGGAAGTTAA GCAAGGGCAC AgCCATGA GAAAGTAACA 3060 

SSXStoa acctggattt ctttggctag tgttctaaat c tgg obkmot 3120 

JccStTCCT TGAAAGGGCT CCAGCTTTAA ^GCTTCCAAA tqgc TTCXCCA CTA 3180 

S-CACTGGT TATTTACTGC ATTATGTCTC AGTTTCGCAG q GGTGGAAG GA 3240 

SgAGcItGG ACTATAGCCT GGCTTCAGAG GCCAGGTGAA G ^ GTGGGCAGC ^ 3300 

SgCTGGGCT GTGGCTGGGG GGACTGTGGG GACTCCAAGC tc ATCTGC AAAT 3360 

SSaAAAG TGTGGGTAAC TATTTTTAAG TACTGTGTTG ^ GGATGAATTC 3420 

£SE SSSSS S5S5 ESS — ? «X SS 

Sis sss ss iii ssss sss us 
Sss sss sss ~ isss ss ;s 
is ssss sss sgu - = ss sr. 

Ss SSS SEX |= ~ sss 32 

sss ssss ssss Eii sss sssi :s 

SgIgATTTA GACACAAGGG AAGCCTGAAA GGAGGTGTTG ca ATTTAAAA TA 4200 

ACCCTGTACT TCAAATATAT ATTTTGTGAG .GGAGTGTAGL GAGATCTTCT 4260 

AGATTGAAGG CTGAGTAGTT GAGAG^AA . t gagtq€TCTT 4320 

GAAACTACTG CTTCTAAACA CTTf f GAG TGG^AGACC atccatgcm TCC ACATCCA 43 
GTTACATGTC TGATGCACTT GCiTGTCCTT T R cCTGTCTCTT CGTCACTTGG 44 0 
CGCATTTGTC ACTTATCCCA TATCTGTCA1 AJ£ TTGAGAAGAT GGCAGTTGCT 4500 

TcSgAAA CAGATGTGAT ^ TC ^^-S c xcCTGGC TTTGACACCT CACGAAATAG 4560 
TCTTTCCCTT TTTCCTGCTA AGTAAGGATT JTCTCCTG GCQC -jGCTCTCAAG 4620 

ss sss: sss sss. — « « 

SS SSSS gc|S =|c c ggg- s 

TTf-CAACTG^ TGGTGGAACT GGiGCTTAAA Q CCAC CCCCAC TGCAGGCTTA 43ju 

JicOTcSG GCAGTCAGTT TATTTCTGAC AGACAAACAG CCA^ q 4980 
GAaStSgT GGCTCTGCCT ^GTGTGTTA CAGC^CTGCC caagcTGTAA GGAACTTGGG 040 

SSS SSSS SSS =Tc . jjjcggr «««« 60 

AGTCTGGACT CTGCAGCATG TAGGTOGGCA GCT AG TGTTGCCG 5220 

CACTGATGGA GGAGTAGTAA ^TGGAGAC CGATTCAGAA AATACATAAA 
AAGAAACTGA TGGAAATAAT GCATGAATTG T ^ cagcCATAAA ACCAGGTGAG 53 

CTACTTCAAA TGAGGTCGGA GAAGGTCAGT T CTGCG TAAGT ATAAGTTCTC 

CGAGTACCAT TTTTCTCTAC AAGAAAAACG A 
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ca.agcggc, gaagccccc cc = c xgcc^tca G C = C 5460 

CCTTGGGGTT TCTCTCACAG CAGTAATGGG ^ACTT TA CCCCCTGA CTGTTCCATT 5580 
TGTCATGTGG GATCCCTACT GTGCCCTCCT GGTTTTACGT ATCACCTCCT 5640 

CAGCGGTTTG GAAAGAGAAA AAGAATTTGG AAATAAAACA GAGAGGGGGT "5700 

CCAGCATTTT GGTTTTTAAT TATGTCAATA ACTGGCTTAG £ ^ TATTTAGAGA 5760 

TGGGTGTATT ACCGAGGAAC AAAGGAAGGC TTATATAAA^ ^ AGCCGCXAXA 5820 

ACTGGCAAGC TGTCAAAAAC ^AAGGCCT TACCACCAA. ^ CCTGCTXGXG 5 880 

GCCAGCAGGG CCAC-CACGAG GGATGGTGCA ^U£U TGTTTCAGAA 5940 

ACTCTGAGAG CAACTGCTTT GGAAATGACA GCACTTGGTG CAATT^ 6QQQ 
TGCGTAGAGC GTGTGCTTGG CGACAGTTTT ^AGXTAGO ^ CGTTCAAAC A 6060 

TCCTCATTCT CCTAAGCATG TCTCCATGCT ^AATCCCA ^ XCTAXAAAAX 612 0 

ATGAATCCAT CACTGTAGGA TTCTCOTGOT GATCAAAiCT ATTCACATCC 6180 

ATGGAAGCTT ATTTATTTTT CGTTCTTCCA JATCAGTCTT TTC TAGCTXXACG 624 0 

ACCACAGCAA ATTAAAGGTG AAGGAGGCTG ^GGATGAA c TGGGGTAAGA 6300 

TTCTTCCTTG CAAGGCCACA GGAAAATGCT ^AGCTGTA A GACTCATCTT 6360 

AGTTC A GTCT CCTGCTGGGA CAGCTAACCG CATCTTATAA 6420 
AGGACCAAAT AGGGTCTATC TGGGGTTTTT GTTC^ GCCTGAATTT TTTCTAGGCC 6480 
CACTATTTCA CTGCTCCCAC ^TACAAAC CAAAGATACA TT TCCTTCCC CA 6540 

ACATTACATA AATTTGACCT GGTACCAATA ^TTCTCTA TTGGATTGGA 6600 

CTGTGTTTAA CCCCTTAAGG GATTCAGAAC AACTAGAATC QC 6 660 

AGGGGCCTTA AACATCATCC ATTTCCAACC CTCTGCCAT A TGGGGCACCC 6720 

CTCAGGCTGC CCAGGGCCCC ATCCAGCCTG gCTTGAGCA ^ GAATTCTCTT 6780 

ACAGCTTCTC TGGGCAGCCT GTGCCAACAC CTCACCAQTC TT TTTCCCGTTG 6840 

TTAA"CATCTA ATCTAAATCT CTTCTCTTTT AGTTTAAAGC A GTACTGGAA 6900 

CTATCTGTCC AAGAAATGTG TATTGGTCTC CCTCCTGCTT QC CCAGCTCC TT 6960 

GGCTGCAGTG AGGTCTCCCC ACAGCCTTCT CTTCTCCAGG CC .AACAGTTC 7020 

CAGCCTGTCT TCGTAGGAGA TCATCTTAGT ^CCTCCTC ^ CCTTACAAAG 7080 

CACGGCTTTC TTGTGGAGCC CCAGGTCTGG ATGCAGTACT TCAG TTTGATGCAG 7140 

GCAGAGCAGA TGGGGACAAT CGCTTACCCC JCCCTGCTGG ^ TCAAGCTTTT 7200 

CCCAGGGTAC TGTTGGCCTT TCAGGCTCCC AGACCCCTTG CTG ^ XAAGCTTGTT 7260 

CATCCACCAG AACCCACGCT TCCTGGTIAA TACTTCTGC CCT ATTCXXGCAX 7320 

TCAGGAGACT TCCATTCTTT AGGACAGACT ^TiACAC CTAACAAAAA 7380 

ATATACATTT CAGTTCATGT TTCCTGTAAC AGGACAGAAT A ^ AGCACAXAGT 744 0 

TACATGCAGA ATTCCTAGTG CCATCTCAGT AGGGTTTTC M TTGGGATCAG 7500 

CAATTTGCTG CAAGTACCTT CCAAGCTGCG ^CTCCCA QA ^ GCr£C AG 7560 

TTACCTTTTG .GGGTAAGCTT TTOTMCTGC AGAGACCCTG Q XACAACXXCC 7620 

CTCTGCTCTG TTCTGACTGC ACCATTTTCT AGATCACCCA TTTGTG TTTG 7680 

TTGTCCTCCA TCCTTTCCCA GCTTGTATCT TTGACAAATA TGGAACAGGA 7740 

CTTCAGCAGC CATTTAATTC TTCAGTGTCA TCTTGTTCTG TTG A TATTCTTAC 7800 

TTTTCAGCAG TCTTGCAAAG AACATCTAGC TGAAAACTT fTCGTCACTG ACAAGTTTAT 7860 
CAGTTCTTCT TGTTTGAGGT GAGCCATAAA TTACTAGAAC ^ ACATAXXXTG 7920 

GCATTTTATT ACTTCTATTA TGTACTTACT JTGACA1AA GXCAXA CTTCCGTTAT 7980 

CTGGGATTTC CACAGTGTC. CTGTGXCCTT CA^ATGGTTT TAC^^ ^ 
AACCTTGGCA ATCTGCCCAG CTGCCUAi^ ^ 
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TCTTC^GCCA AXAAACAAAA XGXGAGAAGC CCAAACAAGA ACTTGTGGGG CAGGCXGCCA 8100 

tcSSgaga SS gggxxgxgxa gctcaataga axxaag^at ™cxg 

^ fT>OAr ,, r , ar , TTTTrrrTGA TTTATACAGG CACGCCCCAA GCCAGAG.n^ CTGTCTGCCA ti^u 
SSSot gSSSS Snia TAAGTCATAG GTAACTTTTC TGGTGAATTG 8280 

"EE ='= sss ssss ssss 

EE ess ss SSK S5SSSS SSSSK 

T^CTCATCXX CXXCACAXCA TCAAACCTTT GGCCTGACIG ATGCCTCCCG 8580 

S~ SSS ^CXXTA XXXXXGTAXG AXTXGAAGXC AGAACCXCCG 8640 

GATCAGGAGG GAACACATAG TGGGAATGTA CCCXCAGCXC CAAGGCC^ TCTTCCTTCA 8700 

ATGATCATGC ATGCTACTTA GGAAGGXGXG XGXGXGTGAA TGTAGAA^G ^XXGXXAX 8760 

TTTTTCTTCC TGCTGTCAGG AACATTTTGA ATACCAGAGA AAAAGAA-AG XGCXCXXCXX 8820 

SS GAGXXGXCAC ACXXGCAAAA TAAAGGATGC AGTCCCAAAT GTTCATAATC 8880 
?CA^ScTC SggIgGAXC AGAAACTGTG TATACAATXX CAGGCTTCTC TGAATGCAGC 8940 

iSSSS? tcScSggc cgaggcagta ctagtcagaa CCCXCGC-AAA caggaacaaa 9000 

" gSSgCAGG AGGAAACACC TTGCCCATCA TGAAAGTC-AA TAACCACTGC 9060 
SSSSSa ATCCAGCTCC TGTTTGAGCA GGTGCTGCAC ACTCCCACAC TGAAACAACA 9120 
SSSSSS AXaSSxC CAGGAAGGAT CXXCXTCXXA AGCTTCTTAA XXAXGGXACA 9180 
SSSS GCAGATGACT ATGACTACTG ACAGGAGAAT GAGGAACTAG CTGGGAATAT 9240 
SctlxGGA GTCACCCATT TCTTTACTGG TATTTGGAAA TAATAATTCT 9300 
iSSSS gSgGAGTTA GCGAAGATCT TCATTTCTTC CATGXXG3TG ACAGCACAGT 9360 
StScS£ SSSaCT XACAAGGAAG AGGATAAAAA TCATAGGGAT ^AAAXCXA 

, GA rAATGAGGTT TTAGCTGCAT TTGACATGAA GAAATTC-.-.^-A CCXCXACXGu S-BU 

£££££ SSSgtg tctttttgct tagttactta ! 

rTRTGnacTC AGGTCTCTCG GGCTACXGGC ATGGATTGAT TACAXAG-^As. -TGX-AAXTTXA 9600 
fSm AGGgSxAXG AGXACXXXXG CAGXAAAXCA XAGGGXTAGX AAXGXXAAXC 9660 . 
SSgSSaA 5SSSwS CCAACCCXGA CAGACAXCCC AGCXCAGGTG GAAAXCAAGG 9720 
SSSSSiS JSSSSJSc CAGAGAACAC AGGGACTCXX CXCXXAGGAC CTXXAXGXAC 9780 
:™cS S™AX GXXAGXCAGA AGACXXXCCA XXCXGGCCAC AGXXCAGCXG 
, rrr , aj , TrrT GGJUVXXXXCX CXCCGCXGCA CAGXXCCAGX CATCCO-._-ii XGTACAC^liU 

SSSSSS CGXGAXCCAA GGAGCAGAAG XXCCAGCTAX GGXCAGGGAG 9960 
SSSSg SccXacxca CXGCACXCAA ACAAAGGCGA AACCACAAGA GXGGCXXXXG 10020 
iGCCXGACCG XCCCAAt-i^A ^ CACCAGTACX GGAXTGACCA CGAGGCAACA 10080 

5SSSSS SS " SSSrtS CTAACTGATA CIACMTGCA 10140 

™ KSS 5ESSSS = =gc SSEg 

SB SEES SEEK = =| ~ 
SEE SS gSj£g SSfe ^ i-s 
SS5 JSSS SSSSSS |= =| »~ »«• 
SSSSSS SS SS SS SSSSSS ££~c w « 
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CTCTCTGGGC AGCCTGTGCC AGCACCTCAC CACCCTCTCT ^TGAAGAACT JTTCCCTGAC. 10800 
ATCCAATCTA AGCCTTCCCT ^AGGTT ^TCCACTC CCOC^ TcTCCTTGCA 20 
TACTCTTGTA AAAAGTTGAT TCTCOTCCTT CAGCCTGTCT TTATAGGAGA 10980 

GCCTTCTTCT CTTCTGCAGG ^AACAAGC CCAGCiCCCT ^ ^ GKGCTCCPl 110 40 

GGTGCTCCAG CCCTCTGATC ATCTTTGTGG CCCiCCiC ^ qqcCTCAAAA 11100 

CATCTTTCCT GTACTGGGGG CCCCAGGCCT ^GCAGTA CT CTTCTGATGG 11160 

GAGCAGAGTA AAGAGGGACA ATCACCTTCC ^CCiGCl CACTATTAAA 11220 

AGCCCTGGAT ACAACTGGCT TTCTGAGCTG CJJCHCrcC ^ TCTTCATTTC 11280 

ACAGGAACAA TACAACAGGT GCTGATGGCC «^ GTGTGCTTCT TCCTCCTCAA 11340 
GGTAGATCTT AGATGAGGAA CGTTGAAGTT ^CCTTCTGC ^ CCCCTGCAGC 11400 

ATACTCCTGC CTGATACCTC ACCCCACCTG CCA.CTG ATG TATGACCAAG 11460 

CAGGGCCCTG ATGAACCCGG CACTGCTTCA ^C-CTGTTT 11520 
TTGCACCTAT GAATACACAA ACAATCTGTT °^TlCA TTATCTACCA 11580 

AATTTGCATT GTCAGGAAAT GGTTTAGTAA T^iGCCAAT J TATGGAGTAA 11640 

TGGCTGTTTT TATGGCTGTT AGTAGTGGTA ^GATGAT ^ GCCAATGXGG H700 

AATCAAGACT GTAGATATTG CAACAGACTA T^AATT^ ^^g^ GTG TTGGGAA 11760 
TACTTCCCAC ATTGTATAAG ^TTGGCA AG1TTACAGC > ^ CAAAAGGGGG 11820 
ATTTCTGTAT ACTCAAGAGG GCGTTTTTGA ^TG GCA GTCCCGC TGTGTGTACG 11880 
TGGGAGGAAG TTAAAAGAAG JGGCAGGTGC CTTCCTGCC C CTGGCTGCCT 11940 

ACACTGGCAA- CATGAGGTCT TTGCTAATCT ^™ GAGGACCCTG ATGCTGCTGG 12000 
TAGGGTGCGA TCTGCCTCAG ACCCACAGCC TGGCCAGCAG T TTTGGCXXXC 12060 

CTCAGATGAG GAGAATCAGC CTGTTTAGCT JOCiGMflCA CTGCACGAGA 12120 

CTCAAGAGGA GTTTGGCAAC ^^^^^^ CAGCGCTGCT TGGSATGAGA 12180 
TGATCCAGCA GATCTTTAAC JTGTTTAGCA GAACGATCTG GAGGCTTGCG 12240 

CCCTGCTGGA TAAGTTTTAC JCCGAGCTGT GGAGGATAGC ATCCTGGCTG 12300 

TGATCCAGGG CGTGGGCGTG ^CGAGACCC ^GATGA. ^ AGCCCCXGCG 12360 

TGAGGAAGTA CTTTCAGAGG ATCACCCTGT ACCiG - CCTGAGCACC AACCTGCAAG 12420 
CTTGGGAAGT CGTGAGGGCT JAGATCATGA M^JJjg C GGCCGGCCG CTTCGAGCAG 12480 
AGAGCTTGAG GTCTAAGGAG TAAAAAGTCT AC^iCGGG Q XGAAAAAA A X 12540 

ACATGATAAG ATACATTGAT GAGTTTGGAC ^CAAC ^ AGCTGCAAXA 12 «,0 

GCTTTATTTG TGAAATTTGT GATGCTATTG ggttcAGGGG GAGGTGTGGG 12660 

AACAAGTTAA CAACAACAAT ^CATT GATCCGXCGA 12720 

AGGTTTTTTA AAGCAAGTAA AACCTCTALA an. 
GCGGCCGC 12728 
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SEQ ID NO: 5 

ICCCMCT = CTC^CCC, CggMC ~ CCC = g 
ATGAGGAGAA TCAGCCTGTT TAGCTGCCTG ... cTGTGCTGCA CGAGATGATC 180 
GAGGAGTTTG GCAACCAGTT £AG£AGGCT ^ACCATCC BB^ TGAGACCCTG 
CAGCAGATCT TTAACCTGTT TAGCACCAAG ^TAGC ATCTGGAGGC TTGCGTGATG 

CTGGATAAGT TTTACACCGA GCTGTACCAG C-kGCTGAA ^ AGCh ^ GGCTGTGAGG 
CAGGGCGTGG GCGTGACCGA GACCCCTC.G ^GAA CTGCGCTTGG 
GAAGTCGTGA SS CATGAGGAGC TTTAGCCTGA GCACCAACCT GCAAGAGAGC 
TTGAGGTCTA AGGAGTAA 498 



240 
300 
360 
420 
480 
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SEQIDNO:7 



r-.^^nvn ttpttGCCGA. TGAAAGGATA 60 
TGCCGCCTTC TTTGATATTC ACTCTGTTGT ATTTCATC.C TXCTjWa ^ 
TAACAGTCTG TATAACAGTC TGTGAGGAAA TAC xCGTA^ q>j>(2TTGGGAG ACAAAGCCCA -180 
ATAAGTAATG XTGAATATTG GATAAGGCTG TG1C1CCT AGAGGTTTTT TTGCCTGTTT. 240 

cagcaggtgg tggttggggt ggtggcagct cagtgacag . ttctactgga 300 

TTTTTTTTTT TTTTTTTTTT MOWKK ^^AA GAACCTTTTG GAAACTGTAC 360 
CTGTATGTTT TGACAGGTCA GAAACATTTC JTCAAAA GCCTTTGGTT CTGATTGCAT 420 
AGCCCTTTTC TTTCATTCCC TTTTTGCTTT CTGiCCCA TTG AAAG CTTGGA 480 

TATGGAAAAC GTTGATCGGA ACTTGAGGTT GCTTGATGCT TTATTTTTTC 540 

TAGCTGTTGT TACACGAGAT ACCTTATTAA ^TT ^ctggtGAG GCTTAGATTT 600 

CCTTTGAAGT AGTGAGCGTT CTCTGGTTTT "TCCiX^ CAAATGCTTG TAAATGTTTT 660 
TTCTAATGGG ATTTTTTACC TGATGATCTA GTTGCAxACC QTC ATCTGTGT TT 720 

CCTAGTTAAC ATGTTGATAA CTTCGGATTT ^CATCTTG TTTTTTTATC 780 

CTAGTAAAAA TATATGGCAT TTATAGAAAT JCGTAATTCC * ATAGAATTTT 840 

TCTATGCTCT GTGTGTACAG GTCAAACAGA CTTCACTCC GCCTTAAATT TCCTAGAGCG 900 
ATATGCAGTC TGTCGTTGGT ^TTGTGTTG TAACCAxAC ACG TTTGGC TGCT 960 

ATGCTCAGTA AGGCGGGTTG TCACATGGGT TCAAATGTA. CGCTTCAGAT 1020 

GCCTTCCCGA GATCCAGGAC ACTAAACTGC TTCTGCACT TTCTAAAATA 1080 

CCCAGGGAAG TGCAGATCCA CGTGCATATT CTTAAAGAAG TTGGTAACGG 1140 

TTTTGGCATA GGAAGCAAGC TGCATGGATT TGTxTGGGAC ^ TATGCAGAAG 1200 

AGTGCATAGG TTTTAAACAC AfTTGCAGCA JGC^ACGAG T Q CATTGCAGAT 1260 

TGATGCCTGG ATGCCTGTTG CAGCTGTgA CGGC ClGCC ^ TCCCGGAACA 13 20 

AGGGGTGGGG TGCTTTGTGT CGiGiTCCCA gCGCi AG TAGATGAG TTACTATGAA 1380 

CATCTCACCT GCTGGGTACT ^TCAAACCA TCTT A^AGT T GTTGGGC AAA 1440 

ACAGAGAAGT TCCTCAGTTG GATATTCTCA TGGGA uiCT TCCTTTCTAT 1500 

GTATGATAAA GCATCTCTAT ^GTAAATTA JGCA^TGTT * CTTCAATCTT 1560 

AGCACCACTT ATTGCAGCAG GTGTAGGCTC TGG.GTGGCC T A GTAAACAGTA 1620 

TTAAAGCTTC TTTGGAAATA CACTGACTTG ^TGAAGTCT cc GCCTAXTCA T 1680 

CTTACCTTTG ATCCCAATGA AATCGAGCAT TTCA. TGTA ATT ^TAATAG 1740 

ACCATGTAAT GTAATTTTAC ACCCCCAGTG CTGACACTi aaaa taTTTT AAACTGTGCA 1800 
ACTTTGGCCT CACCCTCTTG TGTACTGTAT TTTGr . ATA ^ ^GAGGA 1860 

TATGATTATT ACATTATGAA AGAGACATTC ^JTCA GCAGGTGTCC TTAAAAAAAA 1920 

GTGCGTGTGC TTTTATAAAT ACAAGTGATT GCAAA TAGT T . TCCTATTTGG 1980 

AAAA^AAAAG TAATATAA.AA AGGACCAGGT GTTTTACA. • ATAAGGCTGT 2040 

TAAACAGTTA CATTTTTATG AAGATTACCA ^GCiGCTGA AT GTCTGGG TAA 2100 

ATTGTCTTCC TGTACCATTG CATTTCCTCA TTCCCAATTT TQ GAATGCAGAG 2160 

ACTAITCAAG AA^TGGCTTT GAAATACAGC ^GGAGCTT cc ^ggattTT 2220 

TTGCACTGCA AAATGTCAGG AAATGGATGT CTCTCAGAAl GCTGAATGTT 2280 

SaStGTAT ATAGTAAGCA GTTTCCTGAT ^CAGCAGGC ^ T AGCA ACTGCGGATT 2340 
GTGTTGCCGG AGACCTGTAT TTCTCAACAA ^AAGATGG * GTTTCATCAT 2400 

TTAATACATT TTCAGCAGAA GTACTTAGTT ^TC^ACC £™ CCXTTTTTT C 2460 

TTTTAGATGT TATACTTGAA ATACTGCATA JCTTTxAGCi T <, TTAAACT GCA 2520 

5KS ESSES SSSSS SgS. s= K 

SSS ESSE SS S«»« TTTTGATGCT .00 
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„ ^rrrirrTT GTCTTCATCT GCTCATCAAA CACTTCATGC 2760 

GTTATTATTT CTGAAACTAG AAATGATGTT ^TTCAl ^ GTCAACTTTT- 2820 

AGAGTGTAAG GCTAGTGAGA AJTGCATACA ^AxTGATA C ^ CTTCTGAATC 288 0 

TATCAGATTT TTTTTTCATT ^AA™£ ^TTCTA ^ •jAAACTTCAT 2940 

TGAAATGCAG TCTGATTGGC ATGAAGAAGC JCAGCACTC ^ GTGCGCXC AG 3000 

TTTCOAATGA AGGAAGTTAA GCAAGGGCAC AGGTCCATOA GAAAGTAACA 3060 
GAGAAAGTGA ACCTGGATTT CTTTGQCTJG ^StcSS TTGAAGGTGG CAGGCAACTT 3120 

CCCGATTCCT TGAAAGGGCT CCAGCTTTAA TCCTTCCAA ^ TTCTCCACTA 3 180 

GGCCACTGGT TATTTACTGC AJTATGTCTC ^TTiCGC Q GGTGGAAG GA 3240 

TTGAGCATGG ACTATAGCCT ^TTCAGAG JCCAGGTGAA ^ GTGGGCAGCA 3300 

GTGCTGGGCT GTGGCTGGGG GSJCTOTGGG JACTCCAAG c ATCTGCAAAT 3360 

CAGGGAAAAG TGTGGGTAAC TATTTTTAAG ^CTGTGTTG GGATGAATTC 3420 

ACGTAGGGTG TGTACTCTCG AAGATTAACA ^GTGGGTTC ^ CTATGATTGG 3480 

ACAGTGGAAG CATTCAAGGG ™GATCATCT ^GACACCA f TCCACGTAAA 3 540 

AAGCGGTATC AGAAGAGCGA GGAAGGTAAG CAGTCTTCA1 GAGCATGTGC 3600 

GCAGTCTGGG AAAGTAGCAC CCCTTGAGCA ^CAAGGA TTCTGGTGCC 3660 

TAGGAGAACT TTCTTGCTGA AJTCTACTTG CAAGAGCTTT G ^ TTCTGCTCAA 3720 

TTCTGCAGCA CCTGCAAGGC CCAGAGCC iG AGCAGAGTGG 3780 

GTCCAAGCTT CAGCAGGTCA TTGTCTTTGC TTCTTCCCCC *f QQ TCTCAGAAAA 3 840 
AACTGATGTC GAAGCCTCCT GTCCACTACC TGTTGCTGCA ^^^^gjuHOCi* 390 0 

AGAGAGCTAA CTCTATGCCA TAGTCTGAAG GTAAAATGGG i T ^ggtgcag 3960 

AGGCAAAACC GGCTGCCCCA ^GAGAAGAAA GCAGTGGTAA ACAT^^ ^ 

AAGCCCCCAG GCAGTGTGAC AGGCCCCTCC ^CACCTAG GCGTTTGGTT 4080 

GCCTAGGGCT GTGCCCGCGA AGTGCGTGTT ^TTGGTGG ^ TGGTTTGTAA 4140 

TTGAGATTTA GACACAAGGG AAGCCTGAAA ^GGTGTTG AT TTAAAATA 4200 

AGCCTGTACT TCAAATATAT ATTTTGTGAG GGAGTGTAGC ^ GAGATCTTCT 4260 

AAGTTGCAAG AGATTGAAGG CTGAGTAGTT GAGAGGGTAA «OGT GAGTGCTCTT 4320 

GAAACTACTG CTTCTAAACA CTTGTTTGAG TGGTGAGACC T TC CACATCCA 4380 

GTTACATGTC TGATGCACTT GCTTGTCCTT "CCMCttC A ? CGTCACXTGG 4 440 

CGCATTTGTC ACTTATCCCA TATCTGTCAT AiCTGACATA GGCAGTTGCT 4500 

TCAGAAGAAA CAGATGTGAT AATCCCCAGC ^CCCCAAGT * CACGAAATAG 4560 

TCTTTCCCTT TTTCCTGCTA AGTAAGGATT gJTCCWWC £ TGCTCTCAAG 4620 

TCTTCCTGCC TTACATTCTG ^ATTATTT CAAA^ATCTT 4680 

TTTGTGTCTT CCTACTCTTA GAGTGAATGC ^GAGT GGXGGGTTTC 4740 
GTTGGCCGCA GTTCTCTGAT GAACACACCT CTGAATAa TCTGCGGAGT TGCAGTTATT 4800 

TCTGAGGAAC GGGCAGCGTT TGCCTCTGAA JGCAAGGAGC J GCTACTTCTT 4860 

TTGCAACTGA TGGTGGAACT GGTGCTTAAA GCAGATTCCC XGCAGGCTTA 4920 

TTCCTTCTTG GCAGTCAGTT TATTTCTGAC AGACAAACA^ GGGATTAAAA 4980 

GAAAGTATGT GGCTCTGCCT GGGTGTGTTA ^CTCTGCC ^ ^ ^cttgGG 5040 

CGGGCACCAT TCATCCCAAA CAGGATCCTC ATTCATGGAT CTCGCATTCC 5100 

CTCCMOCTC AAAACATTAA ^GGAGTACG ^AATXA AAA ^ TCTCAAAGAC ^0 

TAAGTCATTT AGTCTGGACT CTGCAGCATG TAGGTCGGCA ^ Q AGTGTrGCC G 5220 

CACTGATGGA GGAGTAGTAA AAATGGAGAC CGATTCAGAA aatACATAAA 5280 

AAGAAACTGA TGGAAATAAT GCATGAATTG ^GGTGGAC AT ^ ACCAGGTGAG 5340 

CTACTTCAAA TGAGGTCGGA GAAGGTCAGT JTTTTATTAG ATAAGTTCT C 5400 
CGAGTACCAT TTTTCTCTAC AAGAAAAACG ATTCTGAGU 
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CATAGCGGCT GAAGCTCCCC CCTGGCTGCC TGCCATCTCA GCTGGAGTGC AGTGCCATTT 54 60 
CCTTGGGGTT TCTCTCACAG CAGTAATGGG AC AATACTT C ACAAAAATTC TTTCTTTTCC 5520 
TGTCATGTGG GATCCCTACT GTGCCCTCCT GGTTTTACGT TACCCCCTGA CTGTTCCATT 5580 
CAGCGGTTTG GAAAGAGAAA AAGAATTTGG" AAATAAAAdA TGTCTACGTT ATCACCTCCT 5640 
CCAGCATTTT GGTTTTTAAT TATGTCAATA ACTGGCTTAG ATTTGGAAAT GAGAGGGGGT 5700 
TGGGTGTATT ACCGAGGAAC AAAGGAAGGC TTATATAAAC TCAAGTCTTT TATTTAGAGA 5760 
ACTGGCAAGC TGTCAAAAAC AAAAAGGCCT TACCACCAAA TTAAGTGAAT AGCCGCTATA 58 20 
GCCAGCAGGG CCAGCACGAG GGATGGTGCA CTGCTGGCAC TATGCCACGG CCTGCTTGTG 5880 
ACTCTGAGAG CAACTGCTTT GGAAATGACA GCACTTGGTG CAATTTCCTT TGTTTCAGAA 5940 
TGCGTAGAGC GTGTGCTTGG CGACAGTTTT TCTAGTTAGG CCACTTCTTT TTTCCTTCTC 6000 
TCCTCATTCT CCTAAGCATG TCTCCATGCT GGTAATCCCA GTCAAGTGAA CGTTCAAACA 6060 
ATGAATCCAT CACTGTAGGA TTCTCGTGGT GATCAAATCT TTGTGTGAGG TCTATAAAAT 6120 
ATGGAAGCTT ATTTATTTTT CGTTCTTCCA TATCAGTCTT CTCTATGACA ATTCACATCC 6180 
ACCACAGCAA ATTAAAGGTG AAGGAGGCTG GTGGGATGAA GAGGGTCTTC TAGCTTTACG 6240 
TTCTTCCTTG CAAGGCCACA GGAAAATGCT GAGAGCTGTA GAATACAGCC TGGGGTAAGA 6300 
AGTTCAGTCT CCTGCTGGGA CAGCTAACCG CATCTTATAA CCCCTTCTGA GACTCATCTT 6360 
AGGACCAAAT AGGGTCTATC TGGGGTTTTT GTTCCTGCTG TTCCTCCTGG AAGGCTATCT 6420 
CACTATTTCA CTGCTCCCAC GGTTACAAAC CAAAGATACA GCCTGAATTT TTTCTAGGCC 6480 
ACATTACATA AATTTGACCT GGT AC CAATA TTGTTCTCTA TATAGTTATT TCCTTCCCCA 6540 
CTGTGTTTAA CCCCTTAAGG CATTCAGAAC AACTAGAATC ATAGAATGGT TTGGATTGGA 6600 
AGGGGCCTTA AACATCATCC ATTTCCAACC CTCTGCCATG GGCTGCTTGC CACCCACTGG 6660 
CTCAGGCTGC CCAGGGCCCC ATCCAGCCTG GCCTTGAGCA CCTCCAGGGA TGGGGCACCC 6720 
ACAGCTTCTC TGGGCAGCCT GTGCCAACAC CTCACCACTC TCTGGGTAAA GAATTCTCTT 6780 
TTAACATCTA ATCTAAATCT CTTCTCTTTT AGTTTAAAGC CATTCCTCTT TTTCCCGTTG 6840 
CTATCTGTCC AAGAAATGTG TATTGGTCTC CCTCCTGCTT ATAAGCAGGA AGTACTGGAA 6900 
GGCTGCAGTG AGGTCTCCCC ACAGCCTTCT CTTCTCCAGG CTGAACAAGC CCAGCTCCTT 6960 
CAGCCTGTCT TCGTAGGAGA TCATCTTAGT GGCCCTCCTC TGGACCCATT CCAACAGTTC 7020 
CACGGCTTTC TTGTGGAGCC CCAGGTCTGG ATGCAGTACT TCAGATGGGG CCTTACAAAG 7080 
GCAGAGCAGA TGGGGACAAT CGCTTACCCC TCCCTGCTGG CTGCCCCTGT TTTGATGCAG 7140 
CCCAGGGTAC TGTTGGCCTT TCAGGCTCCC AGACCCCTTG CTGATTTGTG TCAAGCTTTT 7200 
CATCCACCAG AACCCACGCT TCCTGGTTAA TACTTCTGCC CTCACTTCTG TAAGCTTGTT 72 60 
TCAGGAGACT TCCATTCTTT AGGACAGACT GTGTTACACC TACCTGCCCT ATTCTTGCAT 7320 
ATATACATTT CAGTTCATGT TTCCTGTAAC AGGACAGAAT ATGTATTCCT CTAACAAAAA 7380 
TACATGCAGA ATTCCTAGTG CCATCTCAGT AGGGTTTTCA TGGCAGTATT AGCACATAGT 7440 
CAATTTGCTG CAAGTACCTT .CCAAGCTGCG GCCTCCCATA AATCCTGTAT TTGGGATCAG 7500 
TTACCTTTTG GGGTAAGCTT TTGTATCTGC AGAGACCCTG GGGGTTCTGA TGTGCTTCAG 7560 
CTCTGCTCTG TTCTGACTGC ACCATTTTCT AGATCACCCA GTTGTTCCTG TACAACTTCC 7620 
TTGTCCTCCA TCCTTTCCCA GCTTGTATCT TTGACAAATA CAGGCCTATT TTTGTGTTTG 7680 
CTTCAGCAGC CATTTAATTC TTCAGTGTCA TCTTGTTCTG TTGATGCCAC TGGAACAGGA 7740 
TTTTCAGCAG TCTTGCAAAG AACATCTAGC TGAAAACTTT CTGCCATTCA ATATTCTTAC 7800 
CAGTTCTTCT TGTTTGAGGT GAGCCATAAA TTACTAGAAC TTCGTCACTG ACAAGTTTAT 7860 
GCATTTTATT ACTTCTATTA TGTACTTACT TTGACATAAC ACAGACACGC ACATATTTTG 7920 
CTGGGATTTC CACAGTGTCT CTGTGTCCTT CACATGGTTT TACTGTCATA CTTCCGTTAT 7980 
AACCTTGGCA ATCTGCCCAG CTGCCCATCA CAAGAAAAGA GATTCCTTTT TTATTACTTC 8040 
TCTTCAGCCA ATAAACAAAA TGTGAGAAGC CCAAACAAGA ACTTGTGGGG CAGGCTGCCA 8100 
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TCAAGGGAGA GACAGCTGAA GGGTTGTGTA GCTCAATAGA ATTAAGAAAT AAT.-AAGCTG 8160 
TGTCAGACAG TTTTGCCTGA TTTATACAGG CACGCCCCAA GCCAGAGAGG CTGTCTGCCA 8220 
AGGCCACCTT GCAGTCCTTG GTTTGTAAGA TAAGTCATAG GTAACTTTTC TGG7GAATTG 8280 
• CGTGGAG AAT CATGATGGCA GTTCTTGCTG TTTACTATGG TAAGATGCTA . AA-.TAGGAGA 8340 
CAGCAAAGT A ACACTTGCTG CTGTAGGTGC TCTGCTATCC AGACAGCGAT GGCACTCGCA 8400 
CACCAAGATG AGGGATGCTC CCAGCTGACG GATGCTGGGG CAGTAACAGT GGGTCCCATG 8460 
CTGCCTGCTC ATTAGCATCA CCTCAGCCCT CACCAGCCCA TCAGAAGGAT CATCCCAAGC 8520 
TGAGGAAAGT TGCTCATCTT CTTCACATCA TCAAACCTTT GGCCTGACTG ATGCCTCCCG 8580 
GATGCTTAAA TGTGGTCACT GACATCTTTA TTTTTCTATG ATTTCAAGTC AGA^CCTCCG 8640 
GATCAGGAGG GAACACATAG TGGGAATGTA CCCTCAGCTC CAAGGCCAGA TC77CCTTCA 8700 
ATGATCATGC ATGCTACTTA GGAAGGTGTG TGTGTGTGAA TGTAGAATTG CCTTTGTTAT 8760 
TTTTTCTTCC TGCTGTCAGG AACATTTTGA ATACCAGAGA AAAAGAAAAG TGCTCTTCTT 8820 
GGCATGGGAG GAGTTGTCAC ACTTGCAAAA TAAAGGATGC AGTCCCAAAT GTTCATAATC 8880 
TCAGGGTCTG AAGGAGGATC AGAAACTGTG TATACAATTT CAGGCTTCTC TGA-.TGCAGC 8940 
TTTTGAAAGC TGTTCCTGGC CGAGGCAGTA CTAGTCAGAA CCCTCGGAAA CAG3AACAAA 9000 
TGTCTTCAAG GTGCAGCAGG AGGAAACACC TTGCCCATCA TGAAAGTGAA TA^CCACTGC 9060 
CGCTGAAGGA ATCCAGCTCC TGTTTGAGCA GGTGCTGCAC ACTCCCACAC TGAAACAACA 9120 
GTTCATTTTT ATAGGACTTC CAGGAAGGAT CTTCTTCTTA AGCTTCTTAA TTATGGTACA 9180 
TCTCCAGTTG GCAGATGACT ATGACTACTG ACAGGAGAAT GAGGAACTAG CTG3GAATAT 9240 
TTCTGTTTGA CCACCATGGA GTCACCCATT TCTTTACTGG TATTTGGAAA TAA7AATTCT 9300 
GAATTGCAAA GCAGGAGTTA GCGAAGATCT TCATTTCTTC CATGTTGGTG _ ACAGCACAGT 93 60 
TCTGGCTATG AAAGTCTGCT TACAAGGAAG AGGATAAAAA TCATAGGGAT AAT AAAT CT A 9420 
AGTTTGAAGA CAATGAGGTT TTAGCTGCAT TTGACATGAA GAAATTGAGA CCTCTACTGG 9480 
ATAGCTATGG TATTTACGTG TCTTTTTGCT TAGTTACTTA TTGACCCCAG CTGAGGTCAA 9540 
GTATGAACTC AGGTCTGTGG GGCTACTGGC ATGGATTGAT TACATACAAC TG7AATTTTA 9600 
GCAGTGATTT AGGGTTTATG AGTACTTTTTG CAGTAAATCA TAGGGTTAGT AA7GTTAATC 9660 
TCAGGGAAAA AAAAAAAAAG CCAACCCTGA CAGACATCCC AGCTCAGGTG GA-ATCAAGG 9720 
ATCACAGCTC AGTGCGGTCC CAGAGAACAC AGGGACTCTT CTCTTAGGAC CT77ATGTAC 9780 
AGGGCCTCAA GATAACTGAT GTTAGTCAGA AGACTTTCCA TTCTGGCCAC AGT7CAGCTG 9840 
AGGCAATCCT GGAATTTTCT CTCCGCTGCA CAGTTCCAGT CATCCCAGTT TGTACAGTTC 9900 
TGGCACTTTT TGGGTCAGGC CGTGATCCAA GGAGCAGAAG TTCCAGCTAT GG7CAGGGAG 9960 
TGCCTGACCG TCCCAACTCA CTGCACTCAA ACAAAGGCGA AACCACAAGA GTGC-CTTTTG 10020 
TTGAAATTGC £GTGTGGCCC AGAGGGGCTG CACCAGTACT GGATTGACCA CGASGCAACA 10080 
TTAATCCTCA GCAAGTGCAA.' TTTGCAGCCA TTAAATTGAA CTAACTGATA CTACAATGCA 10140 
ATCAGTATCA ACAAGTGGTT TGGCTTGGAA. GATGGAGTCT AGGGGCTCTA CASGAGTAGC 10200 
TACTCTCTAA TGGAGTTGCA TTTTGAAGCA GGACACTGTG AAAAGCTGGC CTCCTAAAGA 10260 
GGCTGCTAAA CATTAGGGTC AATTTTCCAG TGCACTTTCT GAAGTGTCTG CAG7TCCCCA 10320 
TGCAAAGCTG CCCAAACATA GCACTTCCAA TTGAATACAA TTATATGCAG GCG7ACTGGT 10380 
TCTTGCCAGC ACTGTCCTTC TCAAATGAAC TCAACAAACA ATTTCAAAGT CTAGTAGAAA 10440 
GTAACAAGCT TTGAATGTCA TTAAAAAGTA TATCTGCTTT CAGTAGTTCA GC77ATTTAT 10500 
GCCCACTAGA AACATCTTGT ACAAGCTGAA CACTGGGGCT CCAGATTAGT GG7AAAACCT 10560 
ACTTTATACA ATCATAGAAT CATAGAATGG CCTGGGTTGG AAGGGACCCC AAGGATCATG 10620 
AAGATCCAAC ACCCCCGCCA CAGGCAGGGC CACCAACCTC CAGATCTGGT AC7AGACCAG 10680 
GCAGCCCAGG GCTCCATCCA ACCTGGCCAT GAACACCTCC AGGGATGGAG CA7CCACAAC 10740 
CTCTCTGGGC AGCCTGTGCC AGCACCTCAC CACCCTCTCT GTGAAGAACT TT7CCCTGAC 10800 
ATCCAATCTA AGCCTTCCCT CCTTGAGGTT AGATCCACTC CCCCTTGTGC TA7CACTGTC 10860 
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TACTCTTGTA AAAAGTTGAT TCTCCTCCTT TTTGGAAGGT TGCAATGAGG TCTCCTTGCA 1092 0 
GCCTTCTTCT CTTCTGCAGG ATGAACAAGC CCAGCTCCCT CAGCCTGTCT TTATAGGAGA 10980 
GGTGCTCCAG CCCTCTGATC ATCTTTGTGG CCZCTCCTCTG GACCCGCTCC AAGAGCTCCA 1104 0 
CATCTTTCCT GTACTGGGGG CCCCAGGCCT GAATGCAGTA CTCCAGATGG GGCCTCAAAA '11100 
GAGCAGAGTA AAGAGGGACA ATCACCTTCC TCACCCTGCT GGCCAGCCCT CTTCTGATGG 11160 
AGCCCTGGAT ACAACTGGCT TTCTGAGCTG CAACTTCTCC TTATCAGTTC CACTATTAAA 11220 
ACAGGAACAA TACAACAGGT GCTGATGGCC AGTGCAGAGT TTTTCACACT TCTTCATTTC 11280 
GGTAGATCTT AGATGAGGAA CGTTGAAGTT GTGCTTCTGC GTGTGCTTCT TCCTCCTCAA 113 40 
ATACTCCTGC CTGATACCTC ACCCCACCTG CCACTGAATG GCTCCATGGC CCCCTGCAGC 11400 
CAGGGCCCTG ATGAACCCGG CACTGCTTCA GATGCTGTTT AATAGCACAG TATGACCAAG 11460 
TTGCACCTAT GAATACACAA ACAATGTGTT GCATCCTTCA GCACTTGAGA AGAAGAGCCA 11520 
AATTTGCATT GTCAGGAAAT GGTTTAGTAA TTCTGCCAAT TAAAACTTGT TTATCTACCA 11580 
TGGCTGTTTT TATGGCTGTT AGTAGTGGTA CACTGATGAT GAACAATGGC TATGCAGTAA 11640 
AATCAAGACT GTAGATATTG CAACAGACTA TAAAATTCCT CTGTGGCTTA GCCAATGTGG 11700 
TACTTCCCAC ATTGTATAAG AAATTTGGCA AGTTTAGAGC AATGTTTGAA GTGTTGGGAA 11760 
ATTTCTGTAT ACTCAAGAGG GCGTTTTTGA CAACTGTAGA ACAGAGGAAT CAAAAGGGGG 11820 
TGGGAGGAAG TTAAAAGAAG AGGCAGGTGC AAGAGAGCTT GCAGTCCCGC TGTGTGTACG 11880 
ACACTGGCAA CATGAGGTCT TTGCTAATCT TGGTGCTTTG CTTCCTGCCC CTGGCTGCCT 11940 
TAGGG 11945 
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SEQ ID NO: 8 

AAAGTCTAGAGTCGGGGCGGCCGGCCGCTTCGAGCAGACATGATAAGATACATTGATGAG 60 

TTTGGACAAACCACAACTAGAATGCAGTGAAAAAAATGCTTTATTTGTGAAATTTGTGAT 120 

GCTATTGCTTTATTTGTAACCATTATAAGCTGCAATAAACAAGTTAACAACAACAATTGC 180 

&TTCATTTTATGTTTCAGGTTCAGGGGGAGGTGTGGGAGGTTTTTTAAAGCAAGTAAAAC 240 
CTCTACAAATGTGGTAAAATCGAT AAGGATCCGTCGAGCGGCCGC 285 
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SEQID NO: 9 



1 CGCGTGGTAGGTGGCGGGGGGTTCCCAGGAGAGCCCCCAGCGCGGACGGC 
AG^CGTCACTCACCGCT^^ 

AACCGCTGCAAGGGCACCGACGTCCAGGCGTGGATCAGAGGCTGCCGGCT 

otgSgagctgccgcgcccggcccgcccgctgcacagccc^ccgctttgc 
200 gagcgcgacgctacccgcttggcagttttaaacgcatccctcattaaaac 



TGTTCAAGAGAATGULTW^ 1 1 v-vjw-v- i + - „_ _ 

GTGCGGCGCGGGCGGAGGGACGGGGCGGGCGCGGGGCCGCCCGGCGGGTG 

soo 2?g^ctctgccggcccgcccggctcgggctgctgcggcgcttacggg 
cSgotctcgccgctgccgcttctcttctctcccgcgcaaggg^^ 
Stcgtgaagccggtagtgtacgggaacgtggcgcggtacttcgggaaga 
agagggaggaggacgggcacacgcatcagtggacggtxtacgtgaagccc 

cg^ag^ottcagcgccgcgcctgggtgcgctgtgggacacagcgagcttc 
tctcgtaggacatgtccgcctacgtgaaaa^aatccagttcaagctgcac 
• Sa^^g^aatcctctccgaggtc^tgttgcgtcgc^tttgc 
iooo ?ccgctcggtcccgctgaggctcgtcgccctcatctttctttcgtgccgc 
ag^gttaIca^ccgccgtacgagatcaccgaaacg^ 

agtacgctcagcttctcgtagtgcttcccccgtcctggcggcccggggct 
1200 gggctgctcgctgctgccggtcacagtcccgccagccgcggagctgactg 

AGCTCCCTTTCCCGCK3ACGTGTGCTCTGTGTTCGGTCAGCGAGGCTATCG 
TTGTGCTACGCCTGAACTACAGCTGTGAGAAGGCCGTGvsAAACCGCtCTC 

1400 aIIctgatttattggcgaaatggctctaaactaaatcgtctcctctcttt 
aag^ac^ 

a^gtctaataacga^tactgaatttaagtaactctgctcacgttgtatga 



800 



1600 



1800 




AAaSaGTTT^ 

TCAACCGAGCCTGACTTT ATTTAAAAAAAATTATTGA a GGTGCTGTGTA.T 
TTTGGTCCTTCCTTAGATATTTCAAGATCCTACTGCC^TGATGCAGCAAC 
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2000 G^GTGTAAGTGCAAAATGAGGATACCTTCGCCGACCGTCATTCACTACTA 
ATGTTTTCTGTGGGATGTGATCGTACAGTGAGTTTGGCTGTGTGAAATTT 
GAATAGCTTGGTATTGGCAGTGATGACGTGATCGATGCCTT6CTTATCAT 
GTTTGAAATGAAGTAGAATAAATGCAGCCTGCTTTATTTGAGATAGTTTG 

2200 GTTCATTTTATGGAATGCAAGCAAAGATTATACTTCCTCACTGAATTGCA 
CTGTCCAAAGGTGTGAAATGTGTGGGGATCTQGAGGACCGTGACCGAGGG 
ACATTGG^TCGCTATCTCCCATTTCTTTTGCTGTTACCAGTTCAGATTTT 



2400 



2600 



2800 



ACAT T GGATCGCTATCTC(JCAH i i.uu^iui * w " 

CTTTTCACCTAGTCTTTAATTCCCAGGGTTTTGTTTTTTCCTTGGTCATA 

GTTTTTGTTTTTCACTCTGGCAAATGATGTTGTGAATTACACTGCTTCAG 

CCACAAAACTGATGGACTGAATGAGGTCATCAAACAAACTTTTCTTCTTC 

~ — ^^,rr.m»i'TimT"PTr>/-T'/-'r'APTTaTraTTTTTACTGCTGTTGTTGAG 



ACCAGGG^AAAGCTGGAAGCTGCCAAAAAGAAAACCAGTTTTGAAATTGC 
TGAG r TTAAAGAA^.GGTTAAAAGCAAGTCGTGAAACCATCAACTGCTTAA 
AGAGTGAAATCAGAAAACTCGAAGAGGATGATCAGTCTAAAGATATGTGA 




3000 



3200 



CGTGGAGTTGTATGCGTTCT CTCC AATTUTU i flfti-^un^ 
TTCa TTTGCAAATCACTGCAGTGTGTGACAACTGACTTTTTATAAATGGC 
A GA?^CA^GAATGAATGTATCCTCATTTTATAGTTAAAATCTATGGGTA 
TGTACTGGTTTATTTCAAGGAGAATGGATCGTAGAGACTTGGAGGCCAGA 
TTGCTGCTTGTATTGACTGCATTTGAGTGGTGTAGGAACATTTTGTCTAT 
GGTCCCGTGTTAGTTTACAGAATGCCACTGTTCACTGTTTTGTTTTGTAT 
TTT^CTTTTTCTACTGCAACGTCAAGGTTTTAAAAGTTGAAWVTAAAACA 
TGCA3GTTTTTTTTAAATATTTTTTTGTCTCTATCCAGTTTGGGCTTCAA 
GTATTATTGTTAACAGCAAGTCCTGATTTAAGTCAGAGGCTGAAGTGTAA 
TGGTATTCAAGATGCTTAAGTCTGTTGTCAGCAAAACAAAAGAGAAAACT 
3400 TCAT a AAATCAGGAAGTTGGCATTTCTAATAACTTCTTTATCAACAGATA 
AGAGTTTCTAGCCCTGCATCTACTTTCACTTATGTAGTTGATGCCTTTAT 
ATTTTGTGTGTTTGGATGCAGGAA.GTGATTCCTACTCTGTTATGTAGATA 
TTCTATTTAACACTTGTACTCTGCTGTGCTTAGCCTTTCCCCATGAAAAT 
' ™„ . ^^-^^^/-TTPTTTT^TAat-CTCATACAGATGGCAG 



3600 



3800 




ACCC T CAGGCTTATAAAGGCTTGGGLAi i- i i>-J. i iftuuun 

TGTG^TGCAGTAACCTCTGCCAGAGAGGAGAAAAGCCCCACAAACCTCAT 

CCCCrrCTTCTATAGCAATCAGTATTACTAATGCTTTGAGAACAGAGCAC 

TGGT^TGAAACGTTTGATAATTAGCATTTAACATGGCTTGGTAAAGATGC 

AGAACTGAAACAGCTGTGACAGTATGAACTCAGTATGGAGACTTCATTAA 

GACAAACAGCTGTTAAAATCAGGCATGTTTCATTGAGGAGGACGGGGCAA 

CTTGCACCAGTGGTGCCCACACAAATCCTTCCTGGCGCTGCAGACCAATT 
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5600 



CAGCACTTCCACCTGAAG^ 



TTATTAAAC*^^^ 
CCAGTGATGACCGTGTCC^CCT^ 

TGTAGTAGTTAGAGCATTCAACCTCTAG 
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, n o i SSS^tSctttgaaatac^gcatgcxsagcttgtctc^ttg 

^aaTPCMAGTTCCACTGCAAAATGTCAGGAAATGGATGTCTCTCAGAAT 

^^SSStttatatgtgtatatagtaagcagtttcctgat 
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2251 TCCAGCAGGCCAAAGAGTCTGCTGAATGTTGTGTTGCCGGAGACCTGTAT 
TTCTCAACAAGGTAAGATGGTATCCTAGCAACTGCGGATTTTAATACATT 
TTCAGCAGAAGTACTTAGTTAATCTCTACCTTTAGGGATCGTTTCATCAT 

2401 TTTTAGATGTTATACTTGAAATACTGCATAACTTTTAGCTTTC^TGGGTT 
CCTTTTTTTCAGCCTTTAGGAGACTGTTAAGCAATTTGCTGTCCAACTTT 
TGTGTTGGTCTTAAACTGCAATAGTAGTTTACCTTGTATTGAAGAAATAA 

2551 AGACCATTTTTATATTAAAAAATACTTTTGTCTGTCTTCATTTTGACTTG 
TCTGATATCCTTGCAGTGCCCATTATGTCAGTTCTGTCAGATATTCAGAC 
ATCAAAACTTAACGTGAGCTCAGTGGAGTTACAGCTGCGGTTITGATGCT 

2701 GTTATTATTTCTGAAACTAGAAATGATGTTGTCTTCA7CTGCTCATCAAA 
CACTTCATGCAGAGTGTAAGGCTAGTGAGAAATGCATACATTTATTGATA 
CTTTTTT AAAGT C AACT TTTT AT C AGATTTTTTTTT C ATTTGGAAATATA 

2851 TTGTTTTCTAGACTGCATAGCTTCTGAATCTGAAATGCAGTCTGATTGGC 
ATGAAGAAGCACAGCACTCTT CAT CTTACTTAAACTT CATTTTGGAATGA 
AGGAAGTTAAGCAAGGGCACAGGTCCATGAAATAGAGACAGTGCGCTCAG 

3001 GAGAAAGTGAACCTGGATTTCTTTGGCTAGTGTTCTAAATCTGTAGTGAG 
GAAAGTAACACCCGATTCCTTGAAAGGGCTCCAGCTTTAATGCTTCCAAA 
TTGAAGGTGGCAGGCAACTTGGCCACTGGTTATTTACTGCATTATGTCTC 

3151 AGTTTCGCAGCTAACCTGGCTTCTCCACTATTGAGCATGGACTATAGCCT 
GGCTTCAGAGGCCAGGTGAAGGTTGGGATGGGTGGAAGGAGTGCTGGGCT 
GTGGCTGGGGGGACTGTGGGGACTCCAAGCTGAGCTTGGGGTGGGCAGCA 

3301 CAGGGAAAAGTGTGGGTAACTATTTTTAAGTACTGTGTTGCAAACGTCTC 
AT CTGC AAAT ACGT AGGGTGTGT ACTCTCGAAGATT AACAGTGTGGGTT C 
AGTAATATATGGATGAATTCACAGTGGAAGCATTCAAGGGTAGATCATCT 

3451 AACGACACCAGATCATCAAGCTATGATTGGAAGCGGTATCAGAAGAGCGA 
GGAAGGTAAGCAGTCTTCATATGTTTTCCCTCCACGTAAAGCAGTCTGGG 
AAAGT AGCAC C C CTTGAGC AGAGACAAGGAAAT AATTCAGGAGCATGTGC 

3 601 TAGGAGAACTTTCTTGCTGAATTCTACTTGCAAGAGCTTTGATGCCTGGC 
TTCTGGTGCCTTCTGCAGCACCTGCAAGGCCCAGAGCCTGTGGTGAGCTG 
GAGGGAAAGATTCTGCTCAAGTCCAAGCTTCAGCAGGTCATTGTCTTTGC 

3 751 TTCTTCCCCCAGCACTGTGCAGCAGAGTGGAACTGATGTCGAAGCCTCCT 
GTCCACTACCTGTTGCTGCAGGCAGACTGCTCTCAGAAAAAGAGAGCTAA 
CT CT ATGCC AT AGT CTGAAGGT AAAATGGGTTTTA^AAAG AAAACACAA 

3 901 AGGCAAAAC CGGCTGCC CC ATGAGAAGAAAGCAGTGGTAAACATGGTAGA 
AAAGGTGCAGAAGCCCCCAGGCAGTGTGACAGGCCCCTCCTGCCACCTAG 
AGGCGGGAACAAGCTTCCCTGCCTAGGGCTCTGCCCGCGAAGTGCGTGTT 

4051 TCTTTGGTGGGTTTTGTTTGGCGTTTGGTTTTGAGATTTAGACACAAGGG 
AAGC CTGAAAGGAGGTGTTGGGCACT ATTTTGGTTTGT AAAGC CTGTACT 
TCAAATATATATTTTGTGAGGGAGTGTAGCGAATTGGCCAATTTAAAATA 

4201 AAGTTGCAAGAGATTGAAGGCTGAGTAGTTGAGAGGGTAACACGTTTAAT 
GAGATCTTCTGAAACTACTGCTTCTAAACACTTGTTTGAGTGGTGAGACC 
TTGGATAGGTGAGTGCTCTTGTTACATGTCTGATGCACTTGCTTGTCCTT 

4351 TTCCATCCACATCCATGCATTCCACATCCACGCATTTGTCACTTATCCCA 
TATCTGTCATATCTGACATACCTGTCTCTTCGTCACTTGGTCAGAAGAAA 
CAGATGTGATAATCCCCAGCCGCCCCAAGTTTGAGAAGATGGCAGTTGCT 

4501 TCTTTCCCTTTTTCCTGCTAAGTAAGGATTTTCTCCTGGCTTTGACACCT 
CACGAAATAGTCTTCCTGCCTTACATTCTGGGCATTATTTCAAATATCTT 
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TGGAGTGCGCTGCTCTCAAGTTTGTGTCTTCCTACTCTTAGAGTGAATGC 

4651 TCTTAGAGTGAAAGAGAAGGAAGAGAAGATG7TGGCCGCAGTTCTCTGAT 
GAACACACCTCTGAATAATGGCCAAAGGTGGGTGGGTTTCTCTGAGGAAC 
GGGCAGCGTTTGCCTCTGAAAGCAAGGAGCTCTGCGGAGTTGCAGTTATT 

4801 TTGCAACTGATGGTGGAACTGGTGCTTAAAGCAGATTCCCTAGGTTCCCT 
GCTACTTCTTTTCCTTCTTGGCAGTCAGTTTATTTCTGACAGACAAACAG 
CCACCCCCACTGCAGGCTTAGAAAGTATGTGGCTCTGCCTGGGTGTGTTA 

4951 CAGC T CTGC CCTGGTGAAAGGGGATT AAAACGGGCAC C ATT CAT CCCAAA 
CAGGATC CT CATTCATGGATC AAGCTGT AAGG AACTTGGGCT C CAACCT C 
AAAACATTAATTGGAGTACGAATGTA^TTA^ACTGCATTCTCGCATTCC 

5101 TAAGTCATTTAGTCTGGACTCTGCAGCATGTAGGTCGGCAGCTCCCACTT 
TCTCAAAGACCACTGATGGAGGAGTAGTAAA-ATGGAGACCGATTCAGAA 
CAACCAACGGAGTGTTGCCGAAGAAACTGATGGAAATAATGCATGAATTG 

5251 TGTGGTGGACATTTTTTTTAAATACATAAA.CTACTTCAAATGAGGTCGGA 
GAAGGTCAGTGTTTTATTAGCAGCCATAAAACCAGGTGAGCGAGTACCAT 
TTTTCTCTACAAGAAAAACGATTCTGAGCTCTGCGTAAGTATAAGTTCTC 

5401 CATAGCGGCTGAAGCTCCCCCCTGGCTGCCTGCCATCTCAGCTGGAGTGC 
AGTGCCATTTCCTTGGGGTTTCTCTCACAGCAGTAATGGGACAATACTTC 
ACAAAAATTCTTTCTTTTCCTGTCATGTGGGATCCCTACTGTGCCCTCCT 

5551 GGTTTTACGTT ACCC CCTGACTGTTC CATTCAGCGGTTTGG AAAGAGAAA 
AAGAATTTGGAAATAAAAC ATGTCTACGT T ATC ACCTC CT CCAGCATTTT 
GGTTTTTAATTATGTCAATAACTGGCTTAGA7TTGGAAATGAGAGGGGGT 

5701 TGGGTGT ATTAC CGAGGAACAAAGG AAGGCT I AT AT AAACT C AAGT CTTT 
TATTTAGAGAACTGGCAAGCTGTCAJLAAAC^AAGGCCTTACCACCAAA 
TTAAGTGAATAGCCGCTATAGCCAGCAGGGCCAGCACGAGGGATGGTGCA 

5851 CTGCTGGCACTATGCCACGGCCTGCTTGTGACTCTGAGAGCA^CTGCTTT 
GGAAATGACAGCACTTGGTGCAATTTCCTTTGTTTCAGAATGCGTAGAGC 
GTGTGCTTGGCGACAGTT-TTTCTAGTTAGGCCACTTCTTTTTTCCTTCTC 

6001 TCCTCATTCTCCTAAGCATGTCTCCATGCTG3TAATCCCAGTCAAGTGAA 
CGTTCAAAGAATGAATCCATCACTGTAGGATTCTCGTGGTGATCAAATCT 
TTGTGTGAGGTCTATAAAATATGGAAGCTTATTTATTTTTCGTTCTTCCA 

6151 TATCAGTCTTCTCTATGACAATTCACATCCA.CCACAGCAAATTAAAGGTG 
AAGGAGGCTGGTGGGATGAAGAGGGTCTTCTAGCTTTACGTTCTTCCTTG 
CAAGGCCACAGGAAAATGCTGAGAGCTGTAGAATACAGCCTGGGGTAAGA 

6301 AGTTCAGT CT C CTGCTGGGAC AGCT AAC CGCATCTTAT AACCCCTTCTGA 
GACTCATCTTAGGACCAAATAGGGTCTATCTGGGGTTTTTGTTCCTGCTG 
TTCCTCCTGGAAGGCTATCTCACTATTTCACTGCTCCCACG3TTACAAAC 

6451 CAAAGATACAGCCTGAATTTTTTCTAGGCCACATTACATAAATTTGACCT 
GGTACCAATATTGTTCTCTATATAGTTATTTCCTTCCCCACTGTGTTTAA 
CCCCTTAAGGCATTCAGAACAACTAGAATCATAGAATGGTTTGGATTGGA 

6601 AGGGGCCTTAAACATCATCCATTTCCAACCCTCTGCCATGGGCTGCTTGC 
CACCCACTGGCTCAGGCTGCCCAGGGCCCCaTCCAGCCTGGCCTTGAGCA 
CCTCCAGGGATGGGGCACCCACAGCTTCTCTGGGCAGCCTGTGCCAACAC 

6751 CTCACCACTCTCTGGGTAAAGAATTCTCTTTTAACATCTAATCTAAATCT 
CTTCTCTTTTAGTTTAAAGCCATTCCTCTTTTTCCCGTTGCTATCTGTCC 
AAGAAATGTGTATTGGTCTCCCTCCTGCTTATAAGCAGGAAGTACTGGAA 

6901 GGCTGCAGTGAGGTCTCCCCACAGCCTTCTCTTCTCCAGGCTGAACAAGC 
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CCAGCTCCTTCAGCCTGTCTTCGTASC-AGATCATCTTAGTGGCCCTCCTC 
TGG^CCCATTCCAACAGTTCCACGGCTTTCTTGTGGAGCCCCAGGTCTGG 
7051 ATGCAGTACTTCAGATGGGGCCITAC^AGGCAGAGCAGATGGGGACAAT 

ScSacccctccctgctggctgcccc^ 

TGTTGGCCTTTCAGGCTCCCA.G.-.CCCCTTGCTGATTTGTGTCAAGCTTTT 

7201 Stccaccagaacccacgcttcctggttaatacttctgccctcacttctg 
^Igcttgtttcaggagacttc^ttctttaggacagactgtgttacacc 
tacctgccctattcttgcatatatacatttcagttcatgtttcctgtaac 

7351 agg^cagaatatgtattcctct.^avaaatacatgcagaattcctagtg 
JStctcagtagggttttcatggcagtattagcacatagtcaatttgctg 
caagtaccttccaagctgcggcctcccataaatcctgtatttgggatcag 

7501 ttaccttttggggtaagcttttgtatctgcagagaccctgggggttctga 

TGTGCTTCAGCTCTGCTCTGTTCTGACTGCACCATTTTCTAGATCACCCA 
GTTGTTCCTGTACAACTTCCTTGTCCTCCATCCTTTCCCAGCTTGTATCT 

7651 TTGACAAATACAGGCCTATTTTTGTGTTTGCTTCAGCAGCCATTTAATTC 
^CAGTGTCATCTTGTTCTGTTGATGCC^CTGGAACAGGATTTTCAGCAG 
TC^TGCAAAGAACATCTAGCTGAAA^CTTTCTGCCATTCAATATTCTTAC 

7801 CAGTTCTTCTTGTTTGAGGTGAGCCATA^ATTACTAGAACTTCGTCACTG 

Saagtttatgcattttattacitctattatgtacttac™ 

ACAGACACGCACATATTTTGCTGGGATTTCCACAGTGTCTCTGTGTCCTT 

7951" cacatggttttactgtcatacttccgttataaccttggcaatctgcccag 

CTGCCCATCACAAGAJVAAGAGATTCCTTTTTTATTACTTCTCTTCAGCCA 

SaSc^^gaga^gccoaacaagaacttgtckmgcaggctgcca 

8101 TCAAGGGAGAGACAGCTGAA&GGITGTGT-AGCTCAATAGAATTAJ^GAAAT 

GCCAGAGAGGCTGTCTGCC^GSCCACCTTGCAGTCCTTGGTTTGTAAGA 
8251 TAAGTC^TAGGTAACTTTTCTGGTGAATTGCGTGGAGAATCATGATGGCA 
G^CTTGCTGTTTACTATGGTA-.GATGCTAAAATAGGAGACAGCAAAGTA. 
ACACTTGCTGCTGTAGGTGCiCTGCTATCCAGACAGCGATGGCACTCGCA 

8401 SaIg\tgagggatgctcccagctgacggatgctgggg^ 

Sgtcccatgctgcctgctcattagcatcacctcagccctcaccagccca 
^gaaggLcatcccaagctgax-aa^ttgctcatcttcttcacatca 

1 LAunnwft i ^rrwaTrtPTT AAATGTGGTCACT 



8 




ga^cIcSLtgggaatgtaccctcagctccaaggccagatcttccttca 

8701 A^SSS^ 

cSSotSSSttcttcctktgtcac^aacattt 

8851 ^ggatccagtcccaa^^ 

agaaactgtgtatacaatttc^ggcttctctgaatgcagcttttgaaagc 

9001 tctcttcaaggtgcagcaggaggaaacaccttgcccatcatgaaagtg^ 

9001 fert.il <-"^*f «„ . ~. » Trr a nrTrPTr:TTTGAGCAGGTGCTGCAC 
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93 01 GAATTGCAAAGCAGGAGTTAGCGAAGATCTTCATTTCTTCCATGTTGGTG 
ACAGCACAGTTCTGGCTATGAAAGTCTGCTTACAAGGAAGAGGATAAAAA 

9-01 TCA T A.GGGATAATAAATCTAAGTTTGAAGACAATGAGGTTTTAGCTGCAT 
TTGACATGAAGAAATTGAGACCTCTACTGGATAGCTATGGTATTTACGTG 
TCTTTTTGCTTAGTTACfTATTGACCCCAGCTGAGGTCAAGTATGAACTC 

9551 AGGTCTCTCGGGCTACTGGOVTGGATTGATTACATACAACTGTAATTTTA 

gcagtgatttagggtttatgagtacttttgcagtaaatcatagggttagt 

AATGTTAATCTCAGGGAAAAAAAAAAAAAGCCAACCCTGACAGACATCCC 
9701 AGCTCAGGTGGAAATCAAGGATCACAGCTCAGTGCGGTCCCAGAGAACAC 
AGGGAC T CTTCTCTTAGGACCTTTATGTACAGGGCCTCAAGATAACTGAT 
'" m,™m^n tv TTPTvyirr a p AnTTCAGCTGAGGCAATCCT 



9851 




nGTAA^ACCTACTTTAiACAViU^XA^AAicH.irt^.-^.xw^^v.^-ww. 

10601 AAGGG^CCCCAAGGATCATGAAGATCCAACACCCCCGCCACAGGCAGGGC 

cac^aVcctccagatctggtactagaccaggcagcccagggctccatcca 

A^CTGGCCATGAACACCTCCAGGGATGGAGCATCCACAACCTCTCTGGGC 
' « o/-"t-^tv rrf ^rr'rr'Tr'TnTGAAGAACTTTTCCCTGAC 



10751 




s T rrs »tcTAAGCCTTCCCTC(_tiija^ i ' 

TATC^^GTCTACTCTTGTAAAAAGTTGATTCTCCTCCTTTTTGGAAGGT 




rCAGCTCCCTCAGCCivilCl i iAiftWftwwww.^ 

ATCTTTGTGGCCCTCCTCTGGACCCGCTCCAAGAGCTCCACATCTTTCCT 
11051 GTA^TGGGGGCCCCAGGCCTGAATGCAGTACTCCAGATGGGGCCTCAAAA 

CTTCTCATG^GCCCTGGATACAACTGGCTTTCTGAGCTGCAACTTCTCC 

11201 ^^^^^^^j^^^-^cTTCATTTCG^TAGATCTTAGATGAGGAA 
CGTTGA^GTTGTGCTTCTGCGTGTGCTTCTTCCTCCTCAAATACTCCTGC 
1 1 751 rTGATACCTCACCCCACCTGCCACTGAATGGCTCCATGGCCCCCTGCAGC 

TATGACCA^TTCCACCTATGAATACAC^AACAATGTGTTGCATCCTTCA 
11501 GCAC^GAGAAGAAGAGCCAAATTTGCATTGTCAGGAAATGGTTTAGTAA 
TTCTGCCAATTAAAACTTGTTTATCTACCATGGCTGTTTTT ATGGCTGTT 
AGTAGTGOTACACTGATGATGAACAATGGCTATGCAGTAAAATCAAGACT 
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11651 GT a ^ V 7 a TTGC AACAGACTAT AAAATTCCTCTGTGGCTTAGCC AATGTGG 
TAC-^CCCACATTGTATAAGAAATTTGGCAAGTTTAGAGCAATGTTTGAA 
GTGITGGGAAAJTTCTGTATACTCAAGAGGGCGTTTTTGACAACTGTAGA 

11 801 ac a ^-gg aatcaaaagggggtgggaggaagttaaaagaagaggcaggtgc 
aagXgagcttgcagtcccgctgtgtgtacgacactggcaacatgaggtct 
ttg'wTaatcttggtgctttgcttcctgcccctggctgccttagggtgcga 

11 951 tctgcctcagacccacagcctgggcagcaggaggaccctgatgctgctgg 

C T C i .GATGAGGAGAATCAGCCTGTTTAGCTGCCTGAAGGATAGGCACGAT 
TTTGGCTTTCCTCAAGAGGAGTTTGGCAACCAGTTTCAGAAGGCTGAGAC 

12 101 CATCCCTGTGCTGCACGAGATGATCCAGCAGATCTTTAACCTGTTTAGCA 
CCAAGGATAGCAGCGCTGCTTGGGATGAGACCCTGCTGGATAAGTTTTAC 
ACCGAGCTGTACCAGCAGCTGAACGATCTGGAGGCTTGCGTGATCCAGGG 

12251 CGTGGGCGTGACCGAGACCCCTCTGATGAAGGAGGATAGCATCCTGGCTG 
TC-a 3GAAGTACTTTCAGAGGATCACCCTGTACCTGAAGGAGAAGAAGTAC 
AGCCCCTGCGCTTGGGAAGTCGTGAGGGCTGAGATCATGAGGAGCTTTAG 

12^01 CCTGAGCACCAACCTGCAAGAGAGCTTGAGGTCTAAGGAGTAAAAAGTCT 
AG a GTCGGGGCGGCGCGTGGTAGGTGGCGGGGGGTTCCCAGGAGAGCCCC 
CAGCGCGGACGGCAGCGCCGTCACTCACCGCTCCGTCTCCCTCCGCCCAG 

12551 GGTCGCCTGGCGCAACCGCTGCAAGGGCACCGACGTCCAGGCGTGGATCA 
GAGGCTGCCGGCTGTGAGGAGCTGCCGCGCCCGGCCCGCCCGCTGCACAG 
CCG-'-CGCTTTGCGAGCGCGACGCTACCCGCTTGGCAGTTTTAAACGCAT 

12701 CCC^CATTAAAACGACTATACGCAAACGCCTTCCCGTCGGTCCGCGTCTC 
TT T CCGCCGCCAGGGCGACACTCGCGGGGAGGGCGGGAAGGGGGCCGGGC 
GGGi'iCCCGCGGCCAACCGTCGCCCCGTGACGGCACCGCCCCGCCCCCGT 

12851 GACGCGGTGCGGGCGCCGGGGCCGTGGGGCTGAGCGCTGCGGCGGGGCCG 
GGCCGGGCCGGGGCGGGAGCTGAGCGCGGCGCGGCTGCGGGCGGCGCCCC 
CTC r GGTGCAATATGTTCAAGAGAATGGCTGAGTTCGGGCCTGACTCCGG 

13001 GGG " - GGGTGAAGGTGCGGCGCGGGCGGAGGGACGGGGCGGGCGCGGGGC 
CG^CCGGCGGGTGCCGGGGCCTCTGCCGGCCCGCCCGGCTCGGGCTGCTG 
CGG-GCTTACGGGCGCGCTTCTCGCCGCTGCCGCTTCTCTTCTCTCCCGC 

13151 GC^ -"-GGCGTCACCATCGTGAAGCCGGTAGTGTACGGGAACGTGGCGCGG 
TAC-^CGGGAAGAAGAGGGAGGAGGACGGGCACACGCATCAGTGGACGGT 
TTi^-TG^AGCCCTACAGGAACGAGGTAGGGCCCGAGCGCGTCGGCCGCC 

13301 GT^C-CGGAGCGCCGGAGCCGTCAGCGCCGCGCCTGGGTGCGCTGTGGGA 
C a C- "'"GAGCTTCTCTCGTAGGACATGTCCGCCTACGTGAAAAAAATCCA 
GtV^GCTGCACGAGAGCTACGGGAATCCTCTCCGAGGTGGGTGTTGCG 

13^51 TCGG^-GGTTTGCTCCGCTCGGTCCCGCTGAGGCTCGTCGCCCTCATCTT 
TC TTTCGTGCCGCAGTCGTTACCAAACCGCCGTACGAGATCACCGAAACG 

13551 GGCTC-GGGCGAATTTGAAATCATCATCAAGATATTTTTCATTGATCCAAA 
CG^-CGACCCGTAAGTACGCTCAGCTTCTCGTAGTGCTTCCCCCGTCCTG 
GCG : -CCCGGGGCTGGGCTGCTCGCTGCTGCCGGTCACAGTCCCGCCAGCC 

13701 GCC^GCTGACTGAGCTCCCTTTCCCGGGACGTGTGCTCTGTGTTCGGTC 
AGCG-GGCTATCGGGAGGGCTTTGGCTGCATTTGGCTTCTCTGGCGCTTA 
GCGCAGGAGCACGTTGTGCTACGCCTGAACTACAGCTGTGAGAAGGCCGT 

13851 GG a - a CCGCTCTCAAACTGATTTATTGGCGAAATGGCTCTAAACTAAATC 
GTC"CCTCTCTTTGGAAATGCTTTAGAGAAGGTCTCTGTGGTAGTTCTTA 
TGC ^ TCT ATCCTAAAGCACTTGGCCAGACAATTTAAAGACATCAAGCAGC 
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i/iOOl ATTTATAGCAGGCACGTTTAATAACGAATACTGAATTTAAGTAACTCTGC 
TCACGTTGTATGACGTTTATTTTCGTATTCCTGAAAGCCATTAAAATCCT 
GTGCAGTTGTTTAGTAAGAACAGCTGCCACTGTTTTGTATCTAGGAGATA 
- ^151 ACTGGTGTTTCCCTACAGTTCTCAAGCTGATAAAACTCTGTCTTTGTATC 
TAGGTAACCCTGTATCACTTGCTGAAGCTTTTTCAGTCTGACACCAATGC 
AATCCTGGGAAAGAAAACTGTAGTTTCTGAATTCTATGATGAAATGGTAT 
1/1301 GAAAATTTTAATGTCAACCGAGCCTGACTTTATTTAAAAAAAATTATTGA 
TGGTGCTGTGTATTTTGGTCCTTCCTTAGATATTTCAAGATCCTACTGCC 
ATGATGCAGCAACTGCTAACGACGTCCCGTCAGCTGACACTTGGTGCTTA 
14*51 CAAGCATGAAACAGAGTGTAAGTGCAAAATGAGGATACCTTCGCCGACCG 
TCATTCACTACTAATGTTTTCTGTGGGATGTGATCGTACAGTGAGTTTGG 
CTGTGTGAAATTTGAATAGCTTGGTATTGGCAGTGATGACGTGATCGATG 
1 1 601 CCTTGCTTATCATGTTTGAAATGAAGTAGAATAAATGCAGCCTGCTTTAT 
TTGAGATAGTTTGGTTCATTTTATGGAATGCAAGCAAAGATTATACTTCC 
TCACTGAATTGCACTGTCCAAAGGTGTGAAATGTGTGGGGATCTGGAGGA 
1*751 CCGTGACCGAGGGACATTGGATCGCTATCTCCCATTTCTTTTGCTGTTAC 
CAGTTCAGATTTTCTTTTCACCTAGTCTTTAATTCCCAGGGTTTTGTTTT 
TTCCTTGGTCATAGTTTTTGTTTTTCACTCTGGCAAATGATGTTGTGAAT 
i *901 TACACTGCTTCAGCCACAAAACTGATGGACTGAATGAGGTCATCAAACAA 
ACTTTTCTTCTTCCGTATTTCCTTTTTTTTCCCCCACTTATCATTTTTAC 
TGCTGTTGTTGAGTCTGTAAGGCTAAAAGTAACTGTTTTGTGCTTTTTCA 
15051 GG ACGTGTGCTTTCCAAATTACTGCCACATATATAAAGAAAGGTTGGAAT 
TTTAAAGATAATTCATGTTTCTTCTTCTTTTTTGCCACCACAGTTGCAGA 
TCTTGAAGTAAAAACCAGGGAAAAGCTGGAAGCTGCCAAAAAGAAAACCA 
15201 GTTTTGAAATTGCTGAGCTTAAAGAAAGGTTAAAAGCAAGTCGTGAAACC 
ATCAACTGCTTAAAGAGTGAAATCAGAAAACTCGAAGAGGATGATCAGTC 
TAA^GATATGTGATGAGTGTTGACTTGGCAGGGAGCCTATAATGAGAATG 
15351 AAAGGACTTCAGTCGTGGAGTTGTATGCGTTCTCTCCAATTCTGTAACGG 
AGACTGTATGAATTTCATTTGCAAATCACTGCAGTGTGTGACAACTGACT 
TTTTATAAATGGCAGAAAACAAGAATGAATGTATCCTCATTTTATAGTTA 
15501 AAATCTATGGGTATGTACTGGTTTATTTCAAGGAGAATGGATCGTAGAGA 
CTTGGAGGCCAGATTGCTGCTTGTATTGACfGCATTTGAGTGGTGTAGGA 
ACATTTTGTCTATGGTCCCGTGTTAGTTTACAGAATGCCACTGTTCACTG 
15651 TTTTGTTTTGTATTTTACTTTTTCTACTGCAACGTCAAGGTTTTAAAAGT 
' • •TGAAAATAAAACATGCAGGTTTTTTTTAAATATTTTTTTGTCTCTATCCA 
GTTTGGGCTTCAAGTATTATTGTTAACAGCAAGTCCTGATTTAAGTCAGA 
15801 GGCTGAAGTGTAATGGTATTCAAGATGCTTAAGTCTGTTGTCAGCAAAAC 
AAAAGAGAAAACTTCATAAAATCAGGAAGTTGGCATTTCTAATAACTTCT 
TTATCAACAGATAAGAGTTTCTAGCCCTGCATCTACTTTCACTTATGTAG 
i 5951 TTGATGCCTTTATATTTTGTGTGTTTGGATGCAGGAAGTGATTCCTACTC 
TGTTATGTAGATATTCTATTTAACACTTGTACTCTGCTGTGCTTAGCCTT 
i 6051 TCCCCATGAAAATTCAGCGGCTGTAAATCCCCCTCTTCTTTTGTAGCCTC 
ATACAGATGGCAGACCCTCAGGCTTATAAAGGCTTGGGCATCTTCTTTAC 
TGCTTTGAGATTCTGTGTTGCAGTAACCTCTGCCAGAGAGGAGAAAAGCC 
16201 CCACAAACCTCATCCCCTTCTTCTATAGCAATCAGTATTACTAATGCTTT 
GAGAACAGAGCACTGGTTTGAAACGTTTGATAATTAGCATTTAACATGGC 
TTGGTAAAGATGCAGAACTGAAACAGCTGTGACAGTATGAACTCAGTATG 
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16501 ££3S£E£E^^ 

CTTATTAAAAAC^TAAATGAAAGACAAATTAGCTTTGCTTGGGTGCaCAG 

16751 ScC^^ 

TTAAATTTGCTTTTAAGCTGTAGCTGAAAAAGAACGTGCTGTCTTCC _C 

, «.« ^r^rrTGGCAGCTCTGTGCAAAGTGCTCTCTGGTCTCACCAGCCTTT, 

16851 5^^^^5?^™^T^nr ACGTCTGAGAGGGCTCAGAGTGGCTTCGTTTG 



TTTGAACAGCGTGTACTGCTTTCTGTAGA^A'l^^'- * 
, S^TCAAAOTGTTCACACTGAACACACTGGAACAGGTTGCC^GQA 
17001 ^^^^^^^^QQ^^^cCTGGAGGCATTCAAGGCCAGGCTGGATGTGG 

i7i 5 iSSSSS^ 

17151 TTCIATG^TOCAACAGCAAATCATATGTACTGAGAGAGGAAACAAACACA 
TTCTATGA1 i ^ i-imrwarprr atttgGTAAAAGAGTCAGGTTTTA 



17451' 



18051 SS5SSSS5SS5SSSSS£S5ic^ 
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GGCGGTTTGCGTATTGGGCGCTCTTCCGCTTCCTCG-CACTGACTCGCT 

SgScggt^ 

GCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAj« ltCGTTGCTGGC 

18951 CA^GTCAGAGGTGGCGAAACCCGACAG^ 

CCCCCTC^AAGCTCCCTCGTGCGCTCTCCTGTTCCo.^ 

CGGAT ACCTGTC CGCCTTTCTCCCTTCGGGAAGCG i wGCTTTCTCATA 

rrrTrTGTGCACGAACCCCCCGTTCAGCCCGACCGw.^wGCCTTATCLtarva 
TAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACITATCGC^CTGG 
192 51 CAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGCA.TGTAGGCGGTGCT 

AT^TATCTGCGCTCTGCTGAAGCCAG^ 
i o^i rva-rTrTTGATCCGGCAAACAAACCACCGCTGGi.-.o^-GTGGTTTTTTl 

i aa^GGATTTTGGTCATGAGATTATCAAAAAGGATC - i -r-.CCTAGATCCTT 
TTr-TCTGJVCAGTTACCAATGCTTAATCAGTGAGGC.-.-uTATCTCAGCGA 
n ,701 tSSSS^ 

acKcgatIcggga^ 

GC^AGACCCACGCTCACCGGCTCCAGATTT^ 
19851 CCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCA^ 

CAGTCTATTA^TTGTTGCCGGGAAGCTAGAGTAAG^«ji^GCCAGT^A 

T&^TTTGCGCAACGTTGTTGCCATTGCTACAGGCA — GGTGTCACGCT 

.oso^SS^aaTgS^^c^S 

ITGAATGTATTT 




9 0 601 AC^^AAATAAACAA^TAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCA 

TA '^AGCTCATTTTTTAACCAATAGGCCGAAATCC-GCA^AATCCCTTA 
9 0751 T^^TCAAAAG^ATAGACCGAGATAGGGTTGAGTGi TGTTCCAGTTTGGA. 

20751 SSSJ^™*^^ 

ACCGTCTATCAGGGCGATGGCCCACTACGTGAACC^C--.CC^ 
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21051 GGTCACGCTGCGCGTAACCACCACACCCGCCGCGCTTAATGCGCCGCTAC 
AGGGCGCGTCCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGA 
TCGGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGGATGTGC 

21201 TGCAAGGCGAT TAAGTTGGGTAACGCC^.GGGTTTTCC CAGTCACGACGTT 
GTAAAACGACGGCCAGTGAATTGTA^TACGACTCACTATAGGGCGAATTG 

213 01 GAGCTCCACCGCGGTGGCGGCCGCTCTAG 
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SEQ ID NO: 11 

CAGGCACGATGGTGCTGAGCCTTAGCTGCTiC^ 

GAAGGACCTGTCCCTTACTCCCCTCA^ 
AAGAGGTTTTTTTTTTTTTTGGTCCAAAAG^CTGTTTG 

GTGACACTTGTCTCAAGCTATTAACCAAGTGT^ 
TTTTCCATTTGAAGCCCCTTGCAA^CA^GAGCA^ 

GAAGGGTTTTGGTGCCAAAGAG^GA^ 
CAGAGGGGACGGTCTGTGAATCAAGCTT 
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SEQ ID NO. 14 
I FN -A 

ATGGCTTTGA CCTTTGCCTT ACTGGTGGCT 
TCTGTGGGCT GCGATCTGCC TCA 



CTCCTGGTGC TGAGCTGCAA GAGCAGCTGC 



SEQ ID NO. 15 
IFN-B ' 

GACCCACAGC CTGGGCAGCA 

CCTGTTTAGC TGCCTGAAGG 



GGAGGACCCT GATGCTGCTG 
ATAGGCACGA TTTTGGCTTT 



GCTCAGATGA GGAGAATCAG 



SEQ ID NO. 16 
IFN-C 

CTCAAGAGGA GTTTGGCAAC CAGTTTCAGA AGGCTGAGAC CATCCCTGTG CTGCACGAGA 
TG 

SEQ ID NO. 17 
IFN-D 

TCCAGCAGAT CTTTAACCTG TTTAGCACCA AGGATAGGAG CGCTGCTTGG GATGAGACCC 
TGCTGGATAA GTTTTACACC GAGCTGTACC AGCA 

SEQ ID NO. 18 
IFN-E 

CTGAACGATC TGGAGGCTTG CGTGATCCAG GGCGTGGGCG TGACCGAGAC CCCTCTGATG 
AAGGAGGATA GCATCCT 

SEQ ID NO. 19 
UFN-F 

GCTGTGAGGA AGTACTTTCA GAGGATCACC CTGTACCTGA AGGAGAAGAA GTACAGCCCT 
TGCGCTTGGG AAGTCGTGAG GG 



SEQ ID NO. 20 
IFN-G 

CTGAGATCAT GAGGAGCTTT AGCCTGAGCA CCAACCTGCA AGAGAGCTTG AGGTCTAAGG 
AGTAA 

SEQ ID NO. 21 
IFN-1 

CCCAAGCTTT CACCATGGCT TTGACCTTTG CCTT 



SEQ ID NO. 22 
IFN-2b 

ATCTGCCTCA GACCCACAG 
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SEQ ID NO. 23 
IFN-3c 

GATTTTGGCT TTCCTCAAGA GGAGTT 

SEQ ID NO. 24 
IFN-4b 

GCACGAGATG ATCCAGCAGA T 

SEQ ID NO. 25 
I FN- 5 

ATCGTTCAGC TGCTGGTACA 

SEQ ID NO. 26 
IFN-6 

CCTCACAGCC AGGATGCTAT 

SEQ ID NO. 27 
IFN-7 

ATGATCTCAG CCCTCACGAC 

SEQ ID- NO. 28 
IFN-2 

CTGT-GGGTCT- GAGGGAGAT 

SEQ ID NO. 29 
IFN-3b 

AACTCCTCTT GAGGAAAGCC AAAATC 

SEQ ID NO. 30 
I FN- 4 

ATCTGCTGGA TCATCTCGTG C 

SEQ ID NO. 31 
I FN- 8 

TGCTCTAGAC TTTTTACTCC TTAGACCTCA AGCTCT 
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01igoV%CACTCGAGG TGAATATCCA AGAAT 
01igo D 2 N °'GAGATCGATT TTGGCTGGAC ACTTG 
Oligo D 3 N °*CACATCGATG TCACAACTTG GGAAT 
Sgo D 4 N0 'TCTAAGCTTC GTCACAGACC GTCCC 
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SEQUENCE LISTING 

<110> AviGenics, Inc. 

<120> Production of Transgenic Avians 
Using Sperm-mediated Transfection 

<130> 1110*6-021-228 

<140> To be assigned 
<141> 2002-09-18 

<150> 60/324,001 
<151> 2001-09-21 

<150> 60/323,961 
<151> 2001-09-21 

<160> 35 

<170> Patentln version 3.1 

<210> 1 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 5pLMAR2 

<400> 1 

tgccgccttc tttgatattc 



mnBmm^UnQ feb 2006 



20 



<210> 2 

<211> 20 

<-21-2» DNA 

<213> Artificial sequence 
<220> 

<223> Primer LE-6.1kbrevl 

<400> 2 

ttggtggtaa ggcctttttg zu 



<210> 3 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer lys-6.1 



<400> 3 

ctggcaagct gtcaaaaaca 



20 
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<210> 4 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer LysElrev 

<400> 4 20 
cagctcacat cgtccaaaga 

<210> 5 

<211> 498 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> IFNMAGMAX 
<220> 

<221> misc_feature 

<222> (1) .7(498) 
<223> 



Jgcgatctgc ctcagaccca cagcctgggc agcaggagga ccctgatgct gctggctcag 
atgaggagaa tcagcctgtt tagctgcctg aaggataggc acgattttgg ctttcctcaa 
gaggagtttg gcaaccagtt tcagaaggct gagaccatcc ctgtgctgca cgagatgatc 
cagcagatct ttaacctgtt tagcaccaag gatagcagcg ctgcttggga tgagaccctg 
ctggataagt tttacaccga gctgtaccag cagctgaacg atctggaggc ttgcgtgatc 
cagggcgtgg gcgtgaccga gacccctctg atgaaggagg atagcatcct gg^gtgagg 
aagtactttc agaggatcac cctgtacctg aaggagaaga agtacagccc ctgcgcttgg 
gaagtcgtga gggctgagat catgaggagc tttagcctga gcaccaacct gcaagagagc 
ttgaggtcta aggagtaa 

<210> 6 

<211> 12728 

<212> DNA 

<213> Gallus gallus 

<220> 

<221> miscjfeature 

<lllZ 5primf Srix (scaffold) attachment region (MAR) 
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<220> 

<221> misc_feature 
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<222> (261).. (1564) 

<223> 5prime matrix (scaffold) attachment region (MAR) 



<220> 

<221> misc_feature 

<222> (1565) (1912) 

<223> Sprime matrix (scaffold) attachment region (MAR) 



<220> 

<221> misc_feature 

<222> (1930) (2012) , 

<223> Sprime matrix (scaffold) attachment region (MAR) 



<220> 

<221> misc_feature 

<222> (2013) . . (2671) 

<223> Intrinsically curved DNA 



<220> 

<221> misc__feature 

<222> (5848) . . (5934) 

<223> Transcription Enhancer 



<220> 

<221> misc_feature 

<222> (9160) . . (9325) 

<223> Transcription Enhancer 



<220> 

<221> misc__feature 

<222> (9326) . . (9626) 

<223> Negative Regulatory Element 



<220> 

<221> misc_feature 

<222> (9621) . . (9660) 

<223> Hormone Response Element 



<220> 

<221> misc_feature 

<222> (9680) . . (10060) 

<223> Hormone Response Element 



<220> 

<221> misc__feature 

<222> (10576) . . (10821) 

<223> Chicken CRl Repeat Sequence 
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<220> 

<221> misc_feature 

<222> (10926) . . (11193) 

<223> Chicken CR1 Repeat Sequence 

<220> 

<221> misc_feature 

<223> iysozJie*Proxi^il Promoter and Lysozyme Signal Peptide 
<220> 

<221> misc_feature 

<223> San'k^er'feron alpha 2d encoding region codon optimized for exp 
ression in chicken cells (IFNMAGMAX) 

<220> 

<221> polyA_signal 
<222> (12444) . . (12728) 
<223> 

^gccgcc^tc tttgatattc actctgttgt atttcatctc ttcttgccga tgaaaggata 60 
taacagtctg tataacagtc tgtgaggaaa tacttggtat ttcttctgat cagtgttttt 120 
ataagtaatg ttgaatattg gataaggctg tgtgtccttt gtcttgggag acaaagccca 
cagcaggtgg tggttggggt ggtggcagct cagtgacagg agaggttttt ttgcctgttt 
tttttttttt tttttttttt aagtaaggtg ttcttttttc ttagtaaatt ttctactgga 
ctgtatgttt tgacaggtca gaaacatttc ttcaaaagaa gaaccttttg gaaactgtac 
agcccttttc tttcattccc tttttgcttt ctgtgccaat gcctttggtt ctgattgcat 
tatggaaaac gttgatcgga acttgaggtt tttatttata gtgtggcttg aaagcttgga 
tagctgttgt tacacgagat accttattaa gtttaggcca gcttgatgct ttattttttc 
cctttgaagt agtgagcgtt ctctggtttt tttcctttga aactggtgag gcttagattt 
ttctaatggg attttttacc tgatgatcta gttgcatacc caaatgcttg taaatgtttt 
cctagttaac atgttgataa cttcggattt acatgttgta tatacttgtc atctgtgttt 
ctagtaaaaa tatatggcat ttatagaaat acgtaattcc tgatttcctt tttttttatc 
tctatgctct gtgtgtacag gtcaaacaga cttcactcct atttttattt atagaatttt 
atatgcagtc tgtcgttggt tcttgtgttg taaggataca gccttaaatt tcctagagcg 
atgctcagta aggcgggttg tcacatgggt tcaaatgtaa aacgggcacg tttggctgct 
gccttcccga gatccaggac actaaactgc ttctgcactg aggtataaat cgcttcagat 
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cccagggaag tgcagatcca cgtgcatatt cttaaagaag aatgaatact ttctaaaata 
ttttggcata ggaagcaagc tgcatggatt tgtttgggac ttaaattatt ttggtaacgg 
agtgcatagg ttttaaacac agttgcagca tgctaacgag tcacagcgtt tatgcagaag 
tgatgcctgg atgcctgttg cagctgttta cggcactgcc ttgcagtgag cattgcagat 
aggggtgggg tgctttgtgt cgtgttccca cacgctgcca cacagccacc tcccggaaca 
catctcacct gctgggtact tttcaaacca tcttagcagt agtagatgag ttactatgaa 
acagagaagt tcctcagttg gatattctca tgggatgtct tttttcccat gttgggcaaa 
gtatgataaa gcatctctat ttgtaaatta tgcacttgtt agttcctgaa tcctttctat 
agcaccactt attgcagcag gtgtaggctc tggtgtggcc tgtgtctgtg cttcaatctt 
ttaaagcttc tttggaaata cactgacttg attgaagtct cttgaagata gtaaacagta 
cttacctttg atcccaatga aatcgagcat ttcagttgta aaagaattcc gcctattcat 
accatgtaat gtaattttac acccccagtg ctgacacttt ggaatatatt caagtaatag 
actttggcct caccctcttg tgtactgtat tttgtaatag aaaatatttt aaactgtgca 
tatgattatt acattatgaa agagacattc tgctgatctt caaatgtaag aaaatgagga 
gtgcgtgtgc ttttataaat acaagtgatt gcaaattagt gcaggtgtcc ttaaaaaaaa 
aaaaaaaaag taatataaaa aggaccaggt gttttacaag tgaaatacat tcctatttgg 
taaacagtta catttttatg aagattacca gcgctgctga ctttctaaac ataaggctgt 
attgtcttcc tgtaccattg catttcctca ttcccaattt gcacaaggat gtctgggtaa 
actattcaag aaatggcttt gaaatacagc atgggagctt gtctgagttg gaatgcagag 
ttgcactgca aaatgtcagg aaatggatgt ctctcagaat gcccaactcc aaaggatttt 
atatgtgtat atagtaagca gtttcctgat tccagcaggc caaagagtct gctgaatgtt 
gtgttgccgg agacctgtat ttctcaacaa ggtaagatgg tatcctagca actgcggatt 
ttaatacatt ttcagcagaa gtacttagtt aatctctacc tttagggatc gtttcatcat 
ttttagatgt tatacttgaa atactgcata acttttagct ttcatgggtt cctttttttc 
agcctttagg agactgttaa gcaatttgct gtccaacttt tgtgttggtc ttaaactgca 
atagtagttt accttgtatt gaagaaataa agaccatttt tatattaaaa aatacttttg 
tctgtcttca ttttgacttg tctgatatcc ttgcagtgcc cattatgtca gttctgtcag 
atattcagac atcaaaactt aacgtgagct cagtggagtt acagctgcgg ttttgatgct 
gttattattt ctgaaactag aaatgatgtt gtcttcatct gctcatcaaa cacttcatgc 
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agagtgtaag gctagtgaga aatgcataca tttattgata cttttttaaa gtcaactttt 2820 

tatcagattt ttttttcatt tggaaatata ttgttttcta gactgcatag cttctgaatc 2880 

tgaaatgcag tctgattggc atgaagaagc acagcactct tcatcttact taaacttcat 2940 

tttggaatga aggaagttaa gcaagggcac aggtccatga aatagagaca gtgcgctcag 3000 

gagaaagtga acctggattt ctttggctag tgttctaaat ctgtagtgag gaaagtaaca 3060 

cccgattcct tgaaagggct ccagctttaa tgcttccaaa ttgaaggtgg caggcaactt 3120 

ggccactggt tatttactgc attatgtctc agtttcgcag ctaacctggc ttctccacta 3180 

ttgagcatgg actatagcct ggcttcagag gccaggtgaa ggttgggatg ggtggaagga 3240 

gtgctgggct gtggctgggg ggactgtggg gactccaagc tgagcttggg gtgggcagca 3300 

cagggaaaag tgtgggtaac tatttttaag tactgtgttg caaacgtctc atctgcaaat 3360 

acgtagggtg tgtactctcg aagattaaca gtgtgggttc agtaatatat ggatgaattc 3420 

acagtggaag cattcaaggg tagatcatct aacgacacca gatcatcaag ctatgattgg 3480 

aagcggtatc agaagagcga ggaaggtaag cagtcttcat atgttttccc tccacgtaaa 3540 

gcagtctggg aaagtagcac cccttgagca gagacaagga aataattcag gagcatgtgc 3600 

taggagaact ttcttgctga attctacttg caagagcttt gatgcctggc ttctggtgcc 3660 

ttctgcagca cctgcaaggc ccagagcctg tggtgagctg gagggaaaga ttctgctcaa 3720 

gtccaagctt cagcaggtca ttgtctttgc ttcttccccc agcactgtgc agcagagtgg 3780 

aactgatgtc gaagcctcct gtccactacc tgttgctgca ggcagactgc tctcagaaaa 3840 

agagagctaa ctctatgcca tagtctgaag gtaaaatggg ttttaaaaaa gaaaacacaa 3900 

aggcaaaacc ggctgcccca tgagaagaaa gcagtggtaa acatggtaga aaaggtgcag 3960 

aagcccccag gcagtgtgac aggcccctcc tgccacctag aggcgggaac aagcttccct 4020 

gcctagggct ctgcccgcga agtgcgtgtt tctttggtgg gttttgtttg gcgtttggtt 4080 

ttgagattta gacacaaggg aagcctgaaa ggaggtgttg ggcactattt tggtttgtaa 4140 

agcctgtact tcaaatatat attttgtgag ggagtgtagc gaattggcca atttaaaata 4200 

aagttgcaag agattgaagg ctgagtagtt gagagggtaa cacgtttaat gagatcttct 4260 

gaaactactg cttctaaaca cttgtttgag tggtgagacc ttggataggt gagtgctctt 4320 

gttacatgtc tgatgcactt gcttgtcctt ttccatccac atccatgcat tccacatcca 4380 

cgcatttgtc acttatccca tatctgtcat atctgacata cctgtctctt cgtcacttgg 4440 

tcagaagaaa cagatgtgat aatccccagc cgccccaagt ttgagaagat ggcagttgct 4500 
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tctttccctt tttcctgcta agtaaggatt ttctcctggc tttgacacct cacgaaatag 4560 

tcttcctgcc ttacattctg ggcattattt caaatatctt tggagtgcgc tgctctcaag 4620 

tttgtgtctt cctactctta gagtgaatgc tcttagagtg aaagagaagg aagagaagat 4 680 

gttggccgca gttctctgat gaacacacct ctgaataatg gccaaaggtg ggtgggtttc 4740 

tctgaggaac gggcagcgtt tgcctctgaa agcaaggagc tctgcggagt tgcagttatt 4 800 

ttgcaactga tggtggaact ggtgcttaaa gcagattccc taggttccct gctacttctt 4 860 

ttccttcttg gcagtcagtt tatttctgac agacaaacag ccacccccac tgcaggctta 4920 

gaaagtatgt ggctctgcct gggtgtgtta cagctctgcc ctggtgaaag gggattaaaa 4 980 

cgggcaccat tcatcccaaa caggatcctc attcatggat caagctgtaa ggaacttggg 5040 

ctccaacctc aaaacattaa ttggagtacg aatgtaatta aaactgcatt ctcgcattcc 5100 

taagtcattt agtctggact ctgcagcatg taggtcggca gctcccactt tctcaaagac 5160 

cactgatgga ggagtagtaa aaatggagac cgattcagaa caaccaacgg agtgttgccg 5220 

aagaaactga tggaaataat gcatgaattg tgtggtggac atttttttta aatacataaa 5280 

ctacttcaaa tgaggtcgga gaaggtcagt gttttattag cagccataaa accaggtgag 5340 

cgagtaccat ttttctctac aagaaaaacg attctgagct ctgcgtaagt ataagttctc 5400 

catagcggct gaagctcccc cctggctgcc tgccatctca gctggagtgc agtgccattt 5460 

ccttggggtt tctctcacag cagtaatggg acaatacttc acaaaaattc tttcttttcc 5520 

tgtcatgtgg gatccctact gtgccctcct ggttttacgt taccccctga ctgttccatt 5580 

cagcggtttg gaaagagaaa aagaatttgg aaataaaaca tgtctacgtt atcacctcct 5640 

ccagcatttt ggtttttaat tatgtcaata actggcttag atttggaaat gagagggggt 5700 

tgggtgtatt accgaggaac aaaggaaggc ttatataaac tcaagtcttt tatttagaga 5760 

actggcaagc tgtcaaaaac aaaaaggcct taccaccaaa ttaagtgaat agccgctata 5820 

gccagcaggg ccagcacgag ggatggtgca ctgctggcac tatgccacgg cctgcttgtg 5880 

actctgagag caactgcttt ggaaatgaca gcacttggtg caatttcctt tgtttcagaa 5940 
tgcgtagagc gtgtgcttgg cgacagtttt tctagttagg ccacttcttt tttccttctc 
tcctcattct cctaagcatg tctccatgct ggtaatccca gtcaagtgaa cgttcaaaca 
atgaatccat cactgtagga ttctcgtggt gatcaaatct ttgtgtgagg tctataaaat 6120 
atggaagctt atttattttt cgttcttcca tatcagtctt ctctatgaca attcacatcc 6180 
accacagcaa attaaaggtg aaggaggctg gtgggatgaa gagggtcttc tagctttacg 6240 
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ttcttcottg caaggccaca ggaaaatgct gagagctgta gaatacagcc tggggtaaga 6300 
agttcagtct cctgctggga cagctaaccg catcttataa ccccttctga gactcatctt 6360 
aggaccaaat agggtctatc tggggttttt gttcctgctg ttcctcctgg aaggctatct 
cactatttca ctgctcccac ggttacaaac caaagataca gcctgaattt tttctaggcc 
acattacata aatttgacct ggtaccaata ttgttctcta tatagttatt tccttoccca 
ctgtgtttaa ccccttaagg cattcagaac aactagaatc atagaatggt ttggattgga 
aggggcctta aacatcatcc atttccaacc ctctgccatg ggctgcttgc cacccactgg 
ctcaggctgc ccagggcccc atccagcctg gccttgagca cctccaggga tggggcaccc 
acagcttctc tgggcagcct gtgccaacac ctcaccactc tctgggtaaa gaattctctt 
ttaacatcta atctaaatct cttctctttt agtttaaagc cattcctctt tttcccgttg 
ctatctgtcc aagaaatgtg tattggtctc cctcctgctt ataagcagga agtactggaa 
ggctgcagtg aggtctcccc acagccttct cttctccagg ctgaacaagc coagctcctt 
cagcctgtct tcgtaggaga tcatcttagt ggccctcctc tggacccatt ccaacagttc 
cacggctttc ttgtggagcc ccaggtctgg atgcagtact tcagatgggg ccttacaaag 
gcagagcaga tggggacaat cgcttacccc tccctgctgg ctgcccctgt tttgatgcag 
cccagggtac tgttggcctt tcaggctccc agaccccttg ctgatttgtg tcaagctttt 
catccaccag aacccacgct tcctggttaa tacttctgcc ctcacttctg taagcttgtt 
tcaggagact tccattcttt aggacagact gtgttacacc tacctgccct attcttgcat 
atatacattt cagttcatgt ttcctgtaac aggacagaat atgtattect ctaacaaaaa 
tacatgcaga attcctagtg ccatctcagt agggttttca tggcagtatt agcacatagt 
caatttgctg caagtacctt ccaagctgcg gcctcccata aatcctgtat ttgggatcag 
ttaccttttg gggtaagctt ttgtatctgc agagaccctg ggggttctga tgtgcttcag 
ctctgctctg ttctgactgc accattttct agatcaccca gttgttcctg tacaacttcc 
ttgtcctcca tcctttccca gcttgtatct ttgacaaata caggcctatt tttgtgtttg 
cttcagcagc catttaattc ttcagtgtca tcttgttctg ttgatgccac tggaacagga 
ttttcagcag tcttgcaaag aacatctagc tgaaaacttt otgccattca atattcttac 7800 
cagttcttct tgtttgaggt gagccataaa ttactagaac ttcgtcactg acaagtttat 7860 
gcattttatt acttctatta tgtacttact ttgacataac acagacacgc acatattttg 7920 
ctgggatttc cacagtgtct ctgtgtcctt cacatggttt tactgtcata cttccgttat 7980 
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aaccttggca atctgcccag ctgcccatca caagaaaaga gattcctttt ttattacttc 
tcttcagcca ataaacaaaa tgtgagaagc ccaaacaaga acttgtgggg caggctgcca 
tcaagggaga gacagctgaa gggttgtgta gctcaataga attaagaaat aataaagctg 
tgtcagacag ttttgcctga tttatacagg cacgccccaa gccagagagg ctgtctgcca 
aggccacctt gcagtccttg gtttgtaaga taagtcatag gtaacttttc tggtgaattg 
cgtggagaat catgatggca gttcttgctg tttactatgg taagatgcta aaataggaga 
cagcaaagta acacttgctg ctgtaggtgc tctgctatcc agacagcgat ggcactcgca 
caccaagatg agggatgctc ccagctgacg gatgctgggg cagtaacagt gggtcccatg 
ctgcctgctc attagcatca cctcagccct caccagccca tcagaaggat catcccaagc 
tgaggaaagt tgctcatctt cttcacatca tcaaaccttt ggcctgactg atgcctcccg 
gatgcttaaa tgtggtcact gacatcttta tttttctatg atttcaagtc agaacctccg 
gatcaggagg gaacacatag tgggaatgta ccctcagctc caaggccaga tcttccttca 
atgatcatgc atgctactta ggaaggtgtg tgtgtgtgaa tgtagaattg cctttgttat 8760 
tttttcttcc tgctgtcagg aacattttga ataccagaga aaaagaaaag tgctcttctt 8820 
ggcatgggag gagttgtcac acttgcaaaa taaaggatgc agtcccaaat gttcataatc 8880 
tcagggtctg aaggaggatc agaaactgtg tatacaattt caggcttctc tgaatgcagc 8940 
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ttttgaaagc tgttcctggc cgaggcagta ctagtcagaa ccctcggaaa caggaacaaa 
tgtcttcaag gtgcagcagg aggaaacacc ttgcccatca tgaaagtgaa taaccactgc 
cgctgaagga atccagctcc tgtttgagca ggtgctgcac actcccacac tgaaacaaca 
gttcattttt ataggacttc caggaaggat cttcttctta agcttcttaa ttatggtaca 
tctccagttg gcagatgact atgactactg acaggagaat gaggaactag ctgggaatat 
ttctgtttga ccaccatgga gtcacccatt tctttactgg tatttggaaa taataattct 
gaattgcaaa gcaggagtta gcgaagatct tcatttcttc catgttggtg acagcacagt 9360 
tctggctatg aaagtctgct tacaaggaag aggataaaaa tcatagggat aataaatcta 9420 
agtttgaaga caatgaggtt ttagctgcat ttgacatgaa gaaattgaga cctctactgg 9480 
atagctatgg tatttacgtg tctttttgct tagttactta ttgaccccag ctgaggtcaa 9540 
gtatgaactc aggtctctcg ggctactggc atggattgat tacatacaac tgtaatttta 
gcagtgattt agggtttatg agtacttttg cagtaaatca tagggttagt aatgttaatc 
tcagggaaaa aaaaaaaaag ccaaccctga cagacatccc agctcaggtg gaaatcaagg 9720 
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atcacagctc agtgcggtcc cagagaacac agggactctt ctcttaggac ctttatgtac 9780 

agggcctcaa gataactgat gttagtcaga agactttcca ttctggccac agttcagctg 9840 

aggcaatcct ggaattttct ctccgctgca cagttccagt catcccagtt tgtacagttc 9900 

tggcactttt tgggtcaggc cgtgatccaa ggagcagaag ttccagctat ggtcagggag 9960 

tgcctgaccg tcccaactca ctgcactcaa acaaaggcga aaccacaaga gtggcttttg 10020 

ttgaaattgc agtgtggccc agaggggctg caccagtact ggattgacca cgaggcaaca 10080 

ttaatcctca gcaagtgcaa tttgcagcca ttaaattgaa ctaactgata ctacaatgca 10140 

atcagtatca acaagtggtt tggcttggaa gatggagtct aggggctcta caggagtagc 10200 

tactctctaa tggagttgca ttttgaagca ggacactgtg aaaagctggc ctcctaaaga 10260 

ggctgctaaa cattagggtc aattttccag tgcactttct gaagtgtctg cagttcccca 10320 

tgcaaagctg cccaaacata gcacttccaa ttgaatacaa ttatatgcag gcgtactgct 10380 

tcttgccagc actgtccttc tcaaatgaac tcaacaaaca atttcaaagt ctagtagaaa 10440 

gtaacaagct ttgaatgtca ttaaaaagta tatctgcttt cagtagttca gcttatttat 10500 

gcccactaga aacatcttgt acaagctgaa cactggggct ccagattagt ggtaaaacct 10560 

actttataca atcatagaat catagaatgg cctgggttgg aagggacccc aaggatcatg 10620 

aagatccaac acccccgcca caggcagggc caccaacctc cagatctggt actagaccag 10680 

gcagcccagg gctccatcca acctggccat gaacacctcc agggatggag catccacaac 10740 

ctc tctgggc a gcctgtgcc agcacctcac -caccctctct gtgaagaact tttccctgac 10800 

atccaatcta agccttccct ccttgaggtt agatccactc ccccttgtgc tatcactgtc 10860 

tactcttgta aaaagttgat tctcctcctt tttggaaggt tgcaatgagg tctccttgca 10920 

gccttcttct cttctgcagg atgaacaagc ccagctccct cagcctgtct ttataggaga 10980 

ggtgctccag ccctctgatc atctttgtgg ccctcctctg gacccgctcc aagagctcca 11040 

catctttcct gtactggggg ccccaggcct gaatgcagta ctccagatgg ggcctcaaaa 11100 

gagcagagta aagagggaca atcaccttcc tcaccctgct ggccagccct cttctgatgg 11160 

agccctggat acaactggct ttctgagctg caacttctcc ttatcagttc cactattaaa 11220 

acaggaacaa tacaacaggt gctgatggcc agtgcagagt ttttcacact tcttcatttc 11280 
ggtagatctt agatgaggaa cgttgaagtt gtgcttctgc gtgtgcttct tcctcctcaa 11340 
atactcctgc ctgatacctc accccacctg ccactgaatg gctccatggc cccctgcagc 11400 
cagggccctg atgaacccgg cactgcttca gatgctgttt aatagcacag tatgaccaag 11460 
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ttgcacctat gaatacacaa acaatgtgtt gcatccttca gcacttgaga agaagagcca 11520 

aatttgcatt gtcaggaaat ggtttagtaa ttctgccaat taaaacttgt ttatctacca 11580 

tggctgtttt tatggctgtt agtagtggta cactgatgat gaacaatggc tatgcagtaa 11640 

aatcaagact gtagatattg caacagacta taaaattcct ctgtggctta gccaatgtgg 11700 

tacttcccac attgtataag aaatttggca agtttagagc aatgtttgaa gtgttgggaa 11760 

atttctgtat actcaagagg gcgtttttga caactgtaga acagaggaat caaaaggggg 11820 

tgggaggaag ttaaaagaag aggcaggtgc aagagagctt gcagtcccgc tgtgtgtacg 11880 

acactggcaa catgaggtct ttgctaatct tggtgctttg cttcctgccc ctggctgcct 11940 

tagggtgcga tctgcctcag acccacagcc tgggcagcag gaggaccctg atgctgctgg 12000 

ctcagatgag gagaatcagc ctgtttagct gcctgaagga taggcacgat tttggctttc 12060 

ctcaagagga gtttggcaac cagtttcaga aggctgagac catccctgtg ctgcacgaga 12120 

tgatccagca gatctttaac ctgtttagca ccaaggatag cagcgctgct tgggatgaga 12180 

ccctgctgga taagttttac accgagctgt accagcagct gaacgatctg gaggcttgcg 12240 

tgatccaggg cgtgggcgtg accgagaccc ctctgatgaa ggaggatagc atcctggctg 12300 

tgaggaagta ctttcagagg atcaccctgt acctgaagga gaagaagtac agcccctgcg 12360 

cttgggaagt cgtgagggct gagatcatga ggagctttag cctgagcacc aacctgcaag 12420 

agagcttgag gtctaaggag taaaaagtct agagtcgggg cggccggccg cttcgagcag 12480 

acatgataag atacattgat gagtttggac aaaccacaac tagaatgcag tgaaaaaaat 12540 

gctttatttg tgaaatttgt gatgctattg ctttatttgt aaccattata agctgcaata 12600 

aacaagttaa caacaacaat tgcattcatt ttatgtttca ggttcagggg gaggtgtggg 12660 

aggtttttta aagcaagtaa aacctctaca aatgtggtaa aatcgataag gatccgtcga 12720 

gcggccgc 12728 

<210> 7 

<211> 11945 

<212> DNA 

<213> Gallus gallus 

<220> 

<221> misc__feature 
<222> (1)..(237) 

<223> 5prime matrix attachment region (MAR) 
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<222> (261) . . (1564) 

<223> 5prime matrix attachment region (MAR) 



<220> 

<221> misc_feature 

<222> (1565) (1912) 

<223> Sprime matrix attachment region (MAR) 



<220> 

<221> mis cofeature 

<222> (1930) . . (2012) 

<223> 5prime matrix attachment region (MAR) 



<220> 

<221> misc_feature 

<222> (2013) . . (2671) 

<223> Intrinsically Curved DNA 



<220> 

<221> miscjfeature 

<222> (5848) . . (5934) 

<223> Transcription Enhancer 



<220> 

<221> misc_feature 

<222> (9160) . . (9325) 

<223> Transcription Enhancer 



<220> 

<221> misc_feature 

<222> (9326) . . (9626) 

<223> Negative Regulatory Element 



<220> 

<221> misc_feature 

<222> (9621) . . (9660) 

<223> Hormone Response Element 



<220> 

<221> misc_feature 

<222> (9680) . - (10060) 

<223> Hormone Response Element 



<220> 

<221> misc__feature 

<222> (10576) . - (10821) 

<223> Chicken CR1 Repeat 
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<220> 

<221> misc_feature 

<222> (10926) . . (11193) 

<223> Chicken CRl Repeat 

<220> 

<221> misc_feature 

<222> (11424) . . (11938) 

<223> Proximal promoter and lysozyme signal peptide 



<400> / 
tgccgccttc 


tttgatattc 


actctgttgt 


atttcatctc 


ttcttgccga 


tgaaaggata 


60 


taacagtctg 


tataacagtc 


tgtgaggaaa 


tacttggtat 


ttcttctgat 


cagtgttttt 


120 


ataagtaatg 


ttgaatattg 


gataaggctg 


tgtgtccttt 


gtcttgggag 


acaaagccca 


180 


cagcaggtgg 


tggttggggt 


ggtggcagct 


cagtgacagg 


agaggttttt 


ttgcctgttt 


240 


tttttttttt 


tttttttttt 


aagtaaggtg 


ttcttttttc 


ttagtaaatt 


ttctactgga 


300 


ctgtatgttt 


tgacaggtca 


gaaacatttc 


ttcaaaagaa 


gaaccttttg 


gaaactgtac 


360 


agcccttttc 


tttcattccc 


tttttgcttt 


ctgtgccaat 


gcctttggtt 


ctgattgcat 


420 


tatggaaaac 


gttgatcgga 


acttgaggtt 


tttatttata 


gtgtggcttg 


aaagcttgga 


480 


tagctgttgt 


tacacgagat 


accttattaa 


gtttaggcca 


gcttgatgct 


ttattttttc 


540 


cctttgaagt 


agtgagcgtt 


ctctggtttt 


tttcctttga 


aactggtgag 


gcttagattt 


600 


ttctaatggg 


attttttacc 


tgatgatcta 


gttgcatacc 


caaatgcttg 


taaatgtttt 


660 


cctagttaac 


atgttgataa 


cttcggattt 


acatgttgta 


tatacttgtc 


atctgtgttt 


720 


ctagtaaaaa 


tatatggcat 


ttatagaaat 


acgtaattcc 


tgatttcctt 


tttttttatc 


780 


tctatgctct 


gtgtgtacag 


gtcaaacaga 


cttcactcct 


atttttattt 


atagaatttt 


840 


atatgcagtc 


tgtcgttggt 


tcttgtgttg 


taaggataca 


gccttaaatt 


tcctagagcg 


900 


atgctcagta 


aggcgggttg 


tcacatgggt 


tcaaatgtaa 


aacgggcacg 


tttggctgct 


960 


gccttcccga 


gatccaggac 


actaaactgc 


ttctgcactg 


aggtataaat 


cgcttcagat 


1020 


cccagggaag 


tgcagatcca 


cgtgcatatt 


cttaaagaag 


aatgaatact 


ttctaaaata 


1080 


ttttggcata 


ggaagcaagc 


tgcatggatt 


tgtttgggac 


ttaaattatt 


ttggtaacgg 


1140 


agtgcatagg 


ttttaaacac 


agttgcagca 


tgctaacgag 


tcacagcgtt 


tatgcagaag 


1200 


tgatgcctgg 


atgcctgttg 


cagctgttta 


cggcactgcc 


ttgcagtgag 


cattgcagat 


1260 


aggggtgggg 


tgctttgtgt 


cgtgttccca 


cacgctgcca 


cacagccacc 


tcccggaaca 


1320 


catctcacct 


gctgggtact 


tttcaaacca 


tcttagcagt 


agtagatgag 


ttactatgaa 


1380 
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acagagaagt 


tcctcagttg 


gatattctca 


tgggatgtct 


tttttcccat 


gttgggcaaa 


1440 


gtatgataaa 


gcatctctat 


ttgtaaatta 


tgcacttgtt 


agttcctgaa 


tcctttctat 


1500 


agcaccactt 


attgcagcag 


gtgtaggctc 


tggtgtggcc 


tgtgtctgtg 


cttcaatctt 


1560 


ttaaagcttc 


tttggaaata 


cactgacttg 


attgaagtct 


cttgaagata 


gtaaacagta 


1620 


cttacctttg 


atcccaatga 


aatcgagcat 


ttcagttgta 


aaagaattcc 


gcctattcat 


1680 


accatgtaat 


gtaattttac 


acccccagtg 


ctgacacttt 


ggaatatatt 


caagtaatag 


1740 


actttggcct 


caccctcttg 


tgtactgtat 


tttgtaatag 


aaaatatttt 


aaactgtgca 


1800 


tatgattatt 


acattatgaa 


agagacattc 


tgctgatctt 


caaatgtaag 


aaaatgagga 


1860 


gtgcgtgtgc 


ttttataaat 


acaagtgatt 


gcaaattagt 


gcaggtgtcc 


ttaaaaaaaa 


1920 


aaaaaaaaag 


taatataaaa 


aggaccaggt 


gttttacaag 


tgaaatacat 


tcctatttgg 


1980 


taaacagtta 


catttttatg 


aagattacca 


gcgctgctga 


ctttctaaac 


ataaggctgt 


2040 


attgtcttcc 


tgtaccattg 


catttcctca 


ttcccaattt 


gcacaaggat 


gtctgggtaa 


2100 


actattcaag 


aaatggcttt 


gaaatacagc 


atgggagctt 


gtctgagttg 


gaatgcagag 


2160 


ttgcactgca 


aaatgtcagg 


aaatggatgt 


ctctcagaat 


gcccaactcc 


aaaggatttt 


2220 


atatgtgtat 


atagtaagca 


gtttcctgat 


tccagcaggc 


caaagagtct 


gctgaatgtt 


2280 


gtgttgccgg 


agacctgtat 


ttctcaacaa 


ggtaagatgg 


tatcctagca 


actgcggatt 


2340 


ttaatacatt 


ttcagcagaa 


gtacttagtt 


aatctctacc 


tttagggatc 


gtttcatcat 


2400 


ttttagatgt 


tatacttgaa 


atactgcata 


acttttagct 


ttcatgggtt 


cctttttttc 


2460 


agcctttagg 


agactgttaa 


gcaatttgct 


gtccaacttt 


tgtgttggtc 


ttaaaetgea 


2520 


atagtagttt 


accttgtatt 


gaagaaataa 


agaccatttt 


tatattaaaa 


aatacttttg 


2580 


tctgtcttca 


ttttgacttg 


tctgatatcc 


ttgcagtgcc 


cattatgtca 


gttctgtcag 


2640 


atattcagac 


atcaaaactt 


aacgtgagct 


cagtggagtt 


acagctgcgg 


ttttgatgct 


2700 


gttattattt 


ctgaaactag 


aaatgatgtt 


gtcttcatct 


gctcatcaaa 


cacttcatgc 


2760 


agagtgtaag 


gctagtgaga 


aatgcataca 


tttattgata 


cttttttaaa 


gtcaactttt 


2820 


tatcagattt 


ttttttcatt 


tggaaatata 


ttgttttcta 


gactgcatag 


cttctgaatc 


2880 


tgaaatgcag 


tctgattggc 


atgaagaagc 


acagcactct 


tcatcttact 


taaacttcat 


2940 


tttggaatga 


aggaagttaa 


gcaagggcac 


aggtccatga 


aatagagaca 


gtgcgctcag 


3000 


gagaaagtga 


acctggattt 


ctttggctag 


tgttctaaat 


ctgtagtgag 


gaaagtaaca 


3060 


cccgattcct 


tgaaagggct 


ccagctttaa 


tgcttccaaa 


ttgaaggtgg 


caggcaactt 


3120 
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3180 
3240 
3300 



3600 
3660 



ggccactggt tatttactgc attatgtctc agtttcgcag ctaacctggc ttctccacta 
ttgagcatgg actatagcct ggcttcagag gccaggtgaa ggttgggatg ggtggaagga 
gtgctgggct gtggctgggg ggactgtggg gactccaagc tgagcttggg gtgggcagca 

cagggaaaag tgtgggtaac tatttttaag tactgtgttg caaacgtctc atctgcaaat 3360 

acgtagggtg tgtactctcg aagattaaca gtgtgggttc agtaatatat ggatgaattc 3420 

acagtggaag cattcaaggg tagatcatct aacgacacca gatcatcaag ctatgattgg 3480 

aagcggtatc agaagagcga ggaaggtaag cagtcttcat atgttttccc tccacgtaaa 3540 
gcagtctggg aaagtagcac cccttgagca gagacaagga aataattcag gagcatgtgc 
taggagaact ttcttgctga attctacttg caagagcttt gatgcctggc ttctggtgcc 

ttctgcagca cctgcaaggc ccagagcctg tggtgagctg gagggaaaga ttctgctcaa 3720 

gtccaagctt cagcaggtca ttgtctttgc ttcttccccc agcactgtgc agcagagtgg 3780 

aactgatgtc gaagcctcct gtccactacc tgttgctgca ggcagactgc tctcagaaaa 3840 

agagagctaa ctctatgcca tagtctgaag gtaaaatggg ttttaaaaaa gaaaacacaa 3900 

aggcaaaacc ggctgcccca tgagaagaaa gcagtggtaa acatggtaga aaaggtgcag 3960 

aagcccccag gcagtgtgac aggcccctcc tgccacctag aggcgggaac aagcttccct 4020 

gcctagggct ctgcccgcga agtgcgtgtt tctttggtgg gttttgtttg gcgtttggtt 4080 

ttgagattta gacacaaggg aagcctgaaa ggaggtgttg ggcactattt tggtttgtaa 4140 

agcctgtact tcaaatatat attttgtgag ggagtgtagc gaattggcca atttaaaata 4200 

aagttgcaag agattgaagg ctgagtagtt gagagggtaa eacgtttaat gagatcttct 4260 

gaaactactg cttctaaaca cttgtttgag tggtgagacc ttggataggt gagtgctctt 4320 

gttacatgtc tgatgcactt gcttgtcctt ttccatccac atccatgcat tccacatcca 4380 

cgcatttgtc acttatccca tatctgtcat atctgacata cctgtctctt cgtcacttgg 4440 

tcagaagaaa cagatgtgat aatccccagc cgccccaagt ttgagaagat ggcagttgct 4500 

tctttccctt tttcctgcta agtaaggatt ttctcctggc tttgacacct cacgaaatag 4560 

tcttcctgcc ttacattctg ggcattattt caaatatctt tggagtgcgc tgctctcaag 4 620 

tttgtgtctt cctactctta gagtgaatgc tcttagagtg aaagagaagg aagagaagat 4 680 

gttggccgca gttctctgat gaacacacct ctgaataatg gccaaaggtg ggtgggtttc 4740 

tctgaggaac gggcagcgtt tgcctctgaa agcaaggagc tctgcggagt tgcagttatt 4800 

ttgcaactga tggtggaact ggtgcttaaa gcagattccc taggttccct gctacttctt 4860 
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ttccttcttg 


gcagtcagtt 


tatttctgac 


agacaaacag ccacccccac 


tgcaggctta 


4920 


gaaagtatgt 


ggctctgcct 


gggtgtgtta 


cagctctgcc ctggtgaaag 


gggattaaaa 


4980 


cgggcaccat 


tcatcccaaa 


caggatcctc 


attcatggat caagctgtaa 


ggaacttggg 


5040 


ctccaacctc 


aaaacattaa 


ttggagtacg 


aatgtaatta aaactgcatt 


ctcgcattcc 


5100 


taagtcattt 


agtctggact 


ctgcagcatg 


taggtcggca gctcccactt 


tctcaaagac 


5160 


cactgatgga 


ggagtagtaa 


aaatggagac 


cgattcagaa caaccaacgg 


agtgttgccg 


5220 


aagaaactga 


tggaaataat 


gcatgaattg 


tgtggtggac atttttttta 


aatacataaa 


5280 


ctacttcaaa 


tgaggtcgga 


gaaggtcagt 


gttttattag cagccataaa 


accaggtgag 


5340 


cgagtaccat 


ttttctctac 


aagaaaaacg 


attctgagct ctgcgtaagt 


ataagttctc 


5400 


catagcggct 


gaagctcccc 


cctggctgcc 


tgccatctca gctggagtgc 


agtgccattt 


5460 


ccttggggtt 


tctctcacag 


cagtaatggg 


acaatacttc acaaaaattc 


tttcttttcc 


5520 


tgtcatgtgg 


gatccctact 


gtgccctcct 


ggttttacgt taccccctga 


ctgttccatt 


5580 


cagcggtttg 


gaaagagaaa 


aagaatttgg 


aaataaaaca tgtctacgtt 


atcacctcct 


5640 


ccagcatttt 


ggtttttaat 


tatgtcaata 


actggcttag atttggaaat 


gagagggggt 


5700 


tgggtgtatt 


accgaggaac 


aaaggaaggc 


ttatataaac tcaagtcttt 


tatttagaga 


5760 


actggcaagc 


tgtcaaaaac 


aaaaaggcct 


taccaccaaa ttaagtgaat 


agccgctata 


5820 


gccagcaggg 


ccagcacgag 


ggatggtgca 


ctgctggcac tatgccacgg 


cctgcttgtg 


5880 


actctgagag 


caactgcttt 


ggaaatgaca 


gcacttggtg caatttcctt 


tgtttcagaa 


5940 


tgcgtagagc 


gtgtgcttgg 


cgacagtttt 


tctagttagg ccacttcttt 


tttccttctc 


6000 


tcctcattct 


cctaagcatg 


tctccatgct 


ggtaatccca gtcaagtgaa 


cgttcaaaca 


6060 


atgaatccat 


cactgtagga 


ttctcgtggt 


gatcaaatct ttgtgtgagg 


tctataaaat 


6120 


atggaagctt 


atttattttt 


cgttcttcca 


tatcagtctt ctctatgaca 


attcacatcc 


6180 


accacagcaa 


attaaaggtg 


aaggaggctg 


gtgggatgaa gagggtcttc 


tagctttacg 


6240 


ttcttccttg 


caaggccaca 


ggaaaatgct 


gagagctgta gaatacagcc 


tggggtaaga 


6300 


agttcagtct 


cctgctggga 


cagctaaccg 


catcttataa ccccttctga 


gactcatctt 


6360 


aggaccaaat 


agggtctatc 


tggggttttt 


gttcctgctg ttcctcctgg 


aaggctatct 


6420 


cactatttca 


ctgctcccac 


ggttacaaac 


caaagataca gcctgaattt 


tttctaggcc 


6480 


acattacata 


aatttgacct 


ggtaccaata 


ttgttctcta tatagttatt 


tccttcccca 


6540 


ctgtgtttaa 


ccccttaagg 


cattcagaac 


aactagaatc atagaatggt 


ttggattgga 


6600 
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aggggcctta 


aacatcatcc 


atttccaacc ctctgccatg 


ggctgcttgc 


cacccactgg 


6660 


ctcaggctgc 


ccagggcccc 


atccagcctg gccttgagca 


cctccaggga 


tggggcaccc 


6720 


acagcttctc 


tgggcagcct 


gtgccaacac ctcaccactc 


tctgggtaaa 


gaattctctt 


6780 


ttaacatcta 


atctaaatct 


cttctctttt agtttaaagc 


cattcctctt 


tttcccgttg 


6840 


ctatctgt cc 


aagaaatgtg 


tattggtctc cctcctgctt 


ataagcagga 


agtactggaa 


6900 


ggctgcagtg 


aggtctcccc 


acagccttct cttctccagg 


ctgaacaagc 


ccagctcctt 


6960 


cagcctgtct 


tcgtaggaga 


tcatcttagt ggccctcctc 


tggacccatt 


ccaacagttc 


7020 


cacggctttc 


ttgtggagcc 


ccaggtctgg atgcagtact 


tcagatgggg 


ccttacaaag 


7080 


gcagagcaga 


tggggacaat 


cgcttacccc tccctgctgg 


ctgcccctgt 


tttgatgcag 


7140 


cccagggtac 


tgttggcctt 


tcaggctccc agaccccttg 


ctgatttgtg 


tcaagctttt 


7200 


catccaccag 


aacccacgct 


tcctggttaa tacttctgcc 


ctcacttctg 


taagcttgtt 


7260 


tcaggagact 


tccattcttt 


aggacagact gtgttacacc 


tacctgccct 


attcttgcat 


7320 


atatacattt 


cagttcatgt 


ttcctgtaac aggacagaat 


atgtattcct 


ctaacaaaaa 


7380 


tacatgcaga 


attcctagtg 


ccatctcagt agggttttca 


tggcagtatt 


agcacatagt 


7440 


caatttgctg 


caagtacctt 


ccaagctgcg gcctcccata 


aatcctgtat 


ttgggatcag 


7500 


ttaccttttg 


gggtaagctt 


ttgtatctgc agagaccctg 


ggggttctga 


tgtgcttcag 


7560 


ctctgctctg 


ttctgactgc 


accattttct agatcaccca 


gttgttcctg 


tacaacttcc 


7620 


ttgtcctcca 


tcctttccca 


gcttgtatct ttgacaaata 


caggcctatt 


tttgtgtttg 


7680 


cttcagcagc 


catttaattc 


ttcagtgtca tcttgttctg 


ttgatgeeao 


tggaaeagga 


7740 


ttttcagcag 


tcttgcaaag 


aacatctagc tgaaaacttt 


ctgccattca 


atattcttac 


7800 


cagttcttct 


tgtttgaggt 


gagccataaa ttactagaac 


ttcgtcactg 


acaagtttat 


7860 


gcattttatt 


acttctatta 


tgtacttact ttgacataac 


acagacacgc 


acatattttg 


7920 


ctgggatttc 


cacagtgtct 


ctgtgtcctt cacatggttt 


tactgtcata 


cttccgttat 


7980 


aaccttggca 


atctgcccag 


ctgcccatca caagaaaaga 


gattcctttt 


ttattacttc 


8040 


tcttcagcca 


ataaacaaaa 


tgtgagaagc ccaaacaaga 


acttgtgggg 


caggctgcca 


8100 


tcaagggaga 


gacagctgaa 


gggttgtgta gctcaataga 


attaagaaat 


aataaagctg 


8160 


tgtcagacag 


ttttgcctga 


tttatacagg cacgccccaa 


gccagagagg 


ctgtctgcca 


8220 


aggccacctt 


gcagtccttg 


gtttgtaaga taagtcatag 


gtaacttttc 


tggtgaattg 


8280 


cgtggagaat 


catgatggca 


gttcttgctg tttactatgg 


taagatgcta 


aaataggaga 


8340 
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cagcaaagta acacttgctg 


ctgtaggtgc 


tctgctatcc 


agacagcgat 


ggcactcgca 


8400 


caccaagatg agggatgctc 


ccagctgacg 


gatgctgggg 


cagtaacagt 


gggtcccatg 


8460 


ctgcctgctc attagcatca 


cctcagccct 


caccagccca 


tcagaaggat 


catcccaagc 


8520 


tgaggaaagt tgctcatctt 


cttcacatca 


tcaaaccttt 


ggcctgactg 


atgcctcccg 


8580 


gatgcttaaa tgtggtcact 


gacatcttta 


tttttctatg 


atttcaagtc 


agaacctccg 


8640 


gatcaggagg gaacacatag 


tgggaatgta 


ccctcagctc 


caaggccaga 


tcttccttca 


8700 


atgatcatgc atgctactta 


ggaaggtgtg 


tgtgtgtgaa 


tgtagaattg 


cctttgttat 


8760 


tttttcttcc tgctgtcagg 


aacattttga 


ataccagaga 


aaaagaaaag 


tgctcttctt 


8820 


ggcatgggag gagttgtcac 


acttgcaaaa 


taaaggatgc 


agtcccaaat 


gttcataatc 


8880 


tcagggtctg aaggaggatc 


agaaactgtg 


tatacaattt 


caggcttctc 


tgaatgcagc 


8940 


ttttgaaagc tgttcctggc 


cgaggcagta 


ctagtcagaa 


ccctcggaaa 


caggaacaaa 


9000 


tgtcttcaag gtgcagcagg 


aggaaacacc 


ttgcccatca 


tgaaagtgaa 


taaccactgc 


9060 


cgctgaagga atccagctcc 


tgtttgagca 


ggtgctgcac 


actcccacac 


tgaaacaaca 


9120 


gttcattttt ataggacttc 


caggaaggat 


cttcttctta 


agcttcttaa 


ttatggtaca 


9180 


tctccagttg gcagatgact 


atgactactg 


acaggagaat 


gaggaactag 


ctgggaatat 


9240 


ttctgtttga ccaccatgga 


gtcacccatt 


tctttactgg 


tatttggaaa 


taataattct 


9300 


gaattgcaaa gcaggagtta 


gcgaagatct 


tcatttcttc 


catgttggtg 


acagcacagt 


9360 


tctggctatg aaagtctgct 


tacaaggaag 


aggataaaaa 


tcatagggat 


aataaatcta 


9420 


agtttgaaga caatgaggtt 


ttagctgcat 


ttgacatgaa 


gaaattgaga 


cctctactgg 


9480 


atagctatgg tatttacgtg 


tctttttgct 


tagttactta 


ttgaccccag 


ctgaggtcaa 


9540 


gtatgaactc aggtctctcg 


ggctactggc 


atggattgat 


tacatacaac 


tgtaatttta 


9600 


gcagtgattt agggtttatg 


agtacttttg 


cagtaaatca 


tagggttagt 


aatgttaatc 


9660 


tcagggaaaa aaaaaaaaag 


ccaaccctga 


cagacatccc 


agctcaggtg 


gaaatcaagg 


9720 


atcacagctc agtgcggtcc 


cagagaacac 


agggactctt 


ctcttaggac 


ctttatgtac 


9780 


agggcctcaa gataactgat 


gttagtcaga 


agactttcca 


ttctggccac 


agttcagctg 


9840 


aggcaatcct ggaattttct 


ctccgctgca 


cagttccagt 


catcccagtt 


tgtacagttc 


9900 


tggcactttt tgggtcaggc 


cgtgatccaa 


ggagcagaag 


ttccagctat 


ggtcagggag 


9960 


tgcctgaccg tcccaactca 


ctgcactcaa 


acaaaggcga 


aaccacaaga 


gtggcttttg 


10020 


ttgaaattgc agtgtggccc 


agaggggctg 


caccagtact 


ggattgacca 


cgaggcaaca 


10080 



-18- 



WO 03/024199 



PCT/US02/30156 



ttaatcctca 


gcaagtgcaa 


tttgcagcca 


ttaaattgaa 


ctaactgata 


ctacaatgca 


10140 


atcagtatca 


acaagtggtt 


tggcttggaa 


gatggagtct 


aggggctcta 


caggagtagc 


10200 


tactctctaa 


tggagttgca 


ttttgaagca 


ggacactgtg 


aaaagctggc 


ctcctaaaga 


10260 


ggctgctaaa 


cattagggtc 


aattttccag 


tgcactttct 


gaagtgtctg 


cagttcccca 


10320 


tgcaaagctg 


cccaaacata 


gcacttccaa 


ttgaatacaa 


ttatatgcag 


gcgtactgct 


10380 


tcttgccagc 


actgtccttc 


tcaaatgaac 


tcaacaaaca 


atttcaaagt 


ctagtagaaa 


10440 


gtaacaagct 


ttgaatgtca 


ttaaaaagta 


tatctgcttt 


cagtagttca 


gcttatttat 


10500 


gcccactaga 


aacatcttgt 


acaagctgaa 


cactggggct 


ccagattagt 


ggtaaaacct 


10560 


actttataca 


atcatagaat 


catagaatgg 


cctgggttgg 


aagggacccc 


aaggatcatg 


10620 


aagatccaac 


acccccgcca 


caggcagggc 


caccaacctc 


cagatctggt 


actagaccag 


10680 


gcagcccagg 


gctccatcca 


acctggccat 


gaacacctcc 


agggatggag 


catccacaac 


10740 


ctctctgggc 


agcctgtgcc 


agcacctcac 


caccctctct 


gtgaagaact 


tttccctgac 


10800 


atccaatcta 


agccttccct 


ccttgaggtt 


agatccactc 


ccccttgtgc 


tatcactgtc 


10860 


tactcttgta 


aaaagttgat 


tctcctcctt 


tttggaaggt 


tgcaatgagg 


tctccttgca 


10920 


gccttcttct 


cttctgcagg 


atgaacaagc 


ccagctccct 


cagcctgtct 


ttataggaga 


10980 


ggtgctccag 


ccctctgatc 


atctttgtgg 


ccctcctctg 


gacccgctcc 


aagagctcca 


11040 


catctttcct 


gtactggggg 


ccccaggcct 


gaatgcagta 


ctccagatgg 


ggcctcaaaa 


11100 


gagcagagta 


aagagggaca 


atcaccttcc 


tcaccctgct 


ggccagccct 


cttctgatgg 


11160 


agccctggat. 


acaactggct 


ttctgagctg 


caacttctcc 


ttatcagttc 


cactattaaa 


11220 


acaggaacaa 


tacaacaggt 


gctgatggcc 


agtgcagagt 


ttttcacact 


tcttcatttc 


11280 


ggtagatctt 


agatgaggaa 


cgttgaagtt 


gtgcttctgc 


gtgtgcttct 


tcctcctcaa 


11340 


atactcctgc 


ctgatacctc 


accccacctg 


ccactgaatg 


gctccatggc 


cccctgcagc 


11400 


cagggccctg 


atgaacccgg 


cactgcttca 


gatgctgttt 


aatagcacag 


tatgaccaag 


11460 


ttgcacctat 


gaatacacaa 


acaatgtgtt 


gcatccttca 


gcacttgaga 


agaagagcca 


11520 


aatttgcatt 


gtcaggaaat 


ggtttagtaa 


ttctgccaat 


taaaacttgt 


ttatctacca 


11580 


tggctgtttt 


tatggctgtt 


agtagtggta 


cactgatgat 


gaacaatggc 


tatgcagtaa 


11640 


aatcaagact 


gtagatattg 


caacagacta 


taaaattcct 


ctgtggctta 


gccaatgtgg 


11700 


tacttcccac 


attgtataag 


aaatttggca 


agtttagagc 


aatgtttgaa 


gtgttgggaa 


11760 


atttctgtat 


actcaagagg 


gcgtttttga 


caactgtaga 


acagaggaat 


caaaaggggg 


11820 
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tgggaggaag ttaaaagaag aggcaggtgc aagagagctt gcagtcccgc tgtgtgtacg 11880. 
acactggcaa catgaggtct ttgctaatct tggtgctttg cttcctgccc ctggctgcct 11940 



<210> 8 

<211> 285 

<212> DNA 

<213> SV4 0 

<220> 

<221> misc_feature 

<222> (1)..(285) 

<223> SV40 Polyadenylation Sequence 



<400> 8 

aaagtctaga gtcggggcgg ccggccgctt cgagcagaca tgataagata cattgatgag 60 

tttggacaaa ccacaactag aatgcagtga aaaaaatgct ttatttgtga aatttgtgat 120 

gctattgctt tatttgtaac cattataagc tgcaataaac aagttaacaa caacaattgc 180 

attcatttta tgtttcaggt tcagggggag gtgtgggagg ttttttaaag caagtaaaac 240 

ctctacaaat gtggtaaaat cgataaggat ccgtcgagcg gccgc 285 



<210> 9 

<211> 5972 

<212> DNA 

<213> Gallus gallus 
<220> 

<221> mis cofeature 

<222> (1)..(5972) 

<223> Lysozyme 3prime domain 



taggg 



11945 



<400> 9 

cgcgtggtag gtggcggggg gttcccagga gagcccccag cgcggacggc agcgccgtca 



60 



ctcaccgctc cgtctccctc cgcccagggt cgcctggcgc aaccgctgca agggcaccga 



120 



cgtccaggcg tggatcagag gctgccggct gtgaggagct gccgcgcccg gcccgcccgc 



180 



tgcacagccg gccgctttgc gagcgcgacg ctacccgctt ggcagtttta aacgcatccc 



240 



tcattaaaac gactatacgc aaacgccttc ccgtcggtcc gcgtctcttt ccgccgccag 



300 



ggcgacactc gcggggaggg cgggaagggg gccgggcggg agcccgcggc caaccgtcgc 



360 



cccgtgacgg caccgccccg cccccgtgac gcggtgcggg cgccggggcc gtggggctga 



420 



gcgctgcggc ggggccgggc cgggccgggg cgggagctga gcgcggcgcg gctgcgggcg 



480 
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gcgccccctc cggtgcaata tgttcaagag aatggctgag ttcgggcctg actccggggg 540 

cagggtgaag gtgcggcgcg ggcggaggga cggggcgggc gcggggccgc ccggcgggtg 600 

ccggggcctc tgccggcccg cccggctcgg gctgctgcgg cgcttacggg cgcgcttctc 660 

gccgctgccg cttctcttct ctcccgcgca agggcgtcac catcgtgaag ccggtagtgt 720 

acgggaacgt ggcgcggtac ttcgggaaga agagggagga ggacgggcac acgcatcagt 780 

ggacggttta cgtgaagccc tacaggaacg aggtagggcc cgagcgcgtc ggccgccgtt 840 

ctcggagcgc cggagccgtc agcgccgcgc ctgggtgcgc tgtgggacac agcgagcttc 900 

tctcgtagga catgtccgcc tacgtgaaaa aaatccagtt caagctgcac gagagctacg 960 

ggaatcctct ccgaggtggg tgttgcgtcg gggggtttgc tccgctcggt cccgctgagg 1020 

ctcgtcgccc tcatctttct ttcgtgccgc agtcgttacc aaaccgccgt acgagatcac 1080 

cgaaacgggc tggggcgaat ttgaaatcat catcaagata tttttcattg atccaaacga 1140 

gcgacccgta agtacgctca gcttctcgta gtgcttcccc cgtcctggcg gcccggggct 1200 

gggctgctcg ctgctgccgg tcacagtccc gccagccgcg gagctgactg agctcccttt 1260 

cccgggacgt gtgctctgtg. ttcggtcagc gaggctatcg ggagggcttt ggctgcattt 1320 

ggcttctctg gcgcttagcg caggagcacg ttgtgctacg cctgaactac agctgtgaga 1380 

aggccgtgga aaccgctctc aaactgattt attggcgaaa tggctctaaa ctaaatcgtc 1440 

tcctctcttt ggaaatgctt tagagaaggt ctctgtggta gttcttatgc atctatccta 1500 

aagcacttgg ccagacaatt taaagacatc aagcagcatt tatagcaggc acgtttaata 1560 

acgaatactg a.atttaagta actct.gct.ca cgttgtatga cgtttatttt cgtattcctg 1620 

aaagccatta aaatcctgtg cagttgttta gtaagaacag ctgccactgt tttgtatcta 1680 

ggagataact ggtgtttccc tacagttctc aagctgataa aactctgtct ttgtatctag 1740 

gtaaccctgt atcacttgct gaagcttttt cagtctgaca ccaatgcaat cctgggaaag 1800 

aaaactgtag tttctgaatt ctatgatgaa atggtatgaa aattttaatg tcaaccgagc 1860 

ctgactttat ttaaaaaaaa ttattgatgg tgctgtgtat tttggtcctt ccttagatat 1920 

ttcaagatcc tactgccatg atgcagcaac tgctaacgac gtcccgtcag ctgacacttg 1980 

gtgcttacaa gcatgaaaca gagtgtaagt gcaaaatgag gataccttcg ccgaccgtca 2040 

ttcactacta atgttttctg tgggatgtga tcgtacagtg agtttggctg tgtgaaattt 2100 

gaatagcttg gtattggcag tgatgacgtg atcgatgcct tgcttatcat gtttgaaatg 2160 

aagtagaata aatgcagcct gctttatttg agatagtttg gttcatttta tggaatgcaa 2220 
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gcaaagatta 


tacttcctca 


ctgaattgca 


ctgtccaaag 


gtgtgaaatg 


tgtggggatc 


2280 


tggaggaccg 


tgaccgaggg 


acattggatc 


gctatctccc 


atttcttttg 


ctgttaccag 


2340 


ttcagatttt 


cttttcacct 


agtctttaat 


tcccagggtt 


ttgttttttc 


cttggtcata 


2400 


gtttttgttt 


ttcactctgg 


caaatgatgt 


tgtgaattac 


actgcttcag 


ccacaaaact 


2460 


gatggactga 


atgaggtcat 


caaacaaact 


tttcttcttc 


cgtatttcct 


tttttttccc 


2520 


ccacttatca 


tttttactgc 


tgttgttgag 


tctgtaaggc 


taaaagtaac 


tgttttgtgc 


2580 


tttttcagga 


cgtgtgcttt 


ccaaattact 


gccacatata 


taaagaaagg 


ttggaatttt 


2640 


aaagataatt 


catgtttctt 


cttctttttt 


gccaccacag 


ttgcagatct 


tgaagtaaaa 


2700 


accagggaaa 


agctggaagc 


tgccaaaaag 


aaaaccagtt 


ttgaaattgc 


tgagcttaaa 


2760 


gaaaggttaa 


aagcaagtcg 


tgaaaccatc 


aactgcttaa 


agagtgaaat 


cagaaaactc 


2820 


gaagaggatg 


atcagtctaa 


agatatgtga 


tgagtgttga 


cttggcaggg 


agcctataat 


2880 


gagaatgaaa 


ggacttcagt 


cgtggagttg 


tatgcgttct 


ctccaattct 


gtaacggaga 


2940 


ctgtatgaat 


ttcatttgca 


aatcactgca 


gtgtgtgaca 


actgactttt 


tataaatggc 


3000 


agaaaacaag 


aatgaatgta 


tcctcatttt 


atagttaaaa 


tctatgggta 


tgtactggtt 


3060 


tatttcaagg 


agaatggatc 


gtagagactt 


ggaggccaga 


ttgctgcttg 


tattgactgc 


3120 


atttgagtgg 


tgtaggaaca 


ttttgtctat 


ggtcccgtgt 


tagtttacag 


aatgccactg 


3180 


ttcactgttt 


tgttttgtat 


tttacttttt 


ctactgcaac 


gtcaaggttt 


taaaagttga 


3240 


aaataaaaca 


tgcaggtttt 


ttttaaatat 


ttttttgtct 


ctatccagtt 


tgggcttcaa 


3300 


gtattattgt 


taacagcaag 


tcetgattta 


agtcagaggc 


tgaagtgtaa 


tggtattcaa 


3360 


gatgcttaag 


tctgttgtca 


gcaaaacaaa 


agagaaaact 


tcataaaatc 


aggaagttgg 


3420 


catttctaat 


aacttcttta 


tcaacagata 


agagtttcta 


gccctgcatc 


tactttcact 


3480 


tatgtagttg 


atgcctttat 


attttgtgtg 


tttggatgca 


ggaagtgatt 


cctactctgt 


3540 


tatgtagata 


ttctatttaa 


cacttgtact 


ctgctgtgct 


tagcctttcc 


ccatgaaaat 


3600 


tcagcggctg 


taaatccccc 


tcttcttttg 


tagcctcata 


cagatggcag 


accctcaggc 


3660 


ttataaaggc 


ttgggcatct 


tctttactgc 


tttgagattc 


tgtgttgcag 


taacctctgc 


3720 


cagagaggag 


aaaagcccca 


caaacctcat 


ccccttcttc 


tatagcaatc 


agtattacta 


3780 


atgctttgag 


aacagagcac 


tggtttgaaa 


cgtttgataa 


ttagcattta 


acatggcttg 


3840 


gtaaagatgc 


agaactgaaa 


cagctgtgac 


agtatgaact 


cagtatggag 


acttcattaa 


3900 


gacaaacagc 


tgttaaaatc 


aggcatgttt 


cattgaggag 


gacggggcaa 


cttgcaccag 


3960 
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tggtgcccac 


acaaatcctt 


cctggcgctg 


cagaccaatt 


tttctggcat 


tctgactgcc 


4020 


gttgctgctg 


gtcacagaga 


gcaactattt 


ttatcagcca 


caggcaattt 


gcttgtagta 


4080 


ttttccaagt 


gttgtaggta 


agtataaatg 


catcggctcc 


agagcacttt 


gagtatactt 


4140 


attaaaaaca 


taaatgaaag 


acaaattagc 


tttgcttggg 


tgcacagaac 


atttttagtt 


4200 


ccagcctgct 


ttttggtaga 


agccctcttc 


tgaggctaga 


actgactttg 


acaagtagag 


4260 


aaactggcaa 


cggagctatt 


gctatcgaag 


gatccttgtt 


aacaaagtta 


atcgtctttt 


4320 


aaggtttggt 


ttattcatta 


aatttgcttt 


taagctgtag 


ctgaaaaaga 


acgtgctgtc 


4380 


ttccatgcac 


caggtggcag 


ctctgtgcaa 


agtgctctct 


ggtctcacca 


gccttttaat 


4440 


tgccgggatt 


ctggcacgtc 


tgagagggct 


cagactggct 


tcgtttgttt 


gaacagcgtg 


4500 


tactgctttc 


tgtagacatg 


gccggtttct 


ctcctgcagc 


ttatgaaact 


gttcacactg 


4560 


aacacactgg 


aacaggttgc 


ccaaggaggc 


cgtggatgcc 


ccatccctgg 


aggcattcaa 


4620 


ggccaggctg 


gatgtggctc 


tgggcagcct 


ggtctggtgg 


ttggcgatcc 


tgcacatagc 


4680 


agcggggttg 


aaactcgatg 


atcactgtgg 


tccttttcaa 


cccaggctat 


tctatgattc 


4740 


tatgattcaa 


cagcaaatca 


tatgtactga 


gagaggaaac 


aaacacaagt 


gctactgttt 


4800 


gcaagttttg 


ttcatttggt 


aaaagagtca 


ggttttaaaa 


ttcaaaatct 


gtctggtttt 


4860 


ggtgtttttt 


tttttttatt 


tattatttct 


ttggggttct 


ttttgatgct 


ttatctttct 


4920 


ctgccaggac 


tgtgtgacaa 


tgggaacgaa 


aaagaacatg 


ccaggcactg 


tcctggattg 


4980 


cacacgctgg 


ttgcactcag 


tagcaggctc 


agaactgcca 


gtctttccac 


agtattactt 


5040 


tctaaaccta 


attttaatag 


cgttagtaga 


cttccatcac 


tgggcagtgc 


ttagtgaatg 


5100 


ctctgtgtga 


acgttttact 


tataagcatg 


ttggaagttt 


tgatgttcct 


ggatgcagta 


5160 


gggaaggaca 


gattagctat 


gtgaaaagta 


gattctgagt 


atcggggtta 


caaaaagtat 


5220 


agaaacgatg 


agaaattctt 


gttgtaacta 


attggaattt 


ctttaagcgt 


tcacttatgc 


5280 


tacattcata 


gtatttccat 


ttaaaagtag 


gaaaaggtaa 


aacgtgaaat 


cgtgtgattt 


5340 


tcggatggaa 


caccgccttc 


ctatgcacct 


gaccaacttc 


cagaggaaaa 


gcctattgaa 


5400 


agccgagatt 


aagccaccaa 


aagaactcat 


ttgcattgga 


atatgtagta 


tttgccctct 


5460 


tcctcccggg 


taattactat 


actttatagg 


gtgcttatat 


gttaaatgag 


tggctggcac 


5520 


tttttattct 


cacagctgtg 


gggaattctg 


tcctctagga 


cagaaacaat 


tttaatctgt 


5580 


tccactggtg 


actgctttgt 


cagcacttcc 


acctgaagag 


atcaatacac 


tcttcaatgt 


5640 


ctagttctgc 


aacacttggc 


aaacctcaca 


tcttatttca 


tactctcttc 


atgcctatgc 


5700 
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ttattaaagc aataatctgg gtaatttttg ttttaatcac tgtcctgacc ccagtgatga 5760 

ccgtgtccca cctaaagctc aattcaggtc ctgaatctct tcaactctct atagctaaca 5820 

tgaagaatct tcaaaagtta ggtctgaggg acttaaggct aactgtagat gttgttgcct 5880 

ggtttctgtg ctgaaggccg tgtagtagtt agagcattca acctctagaa gaagcttggc 5940 

cagctggtcg acctgcagat ccggccctcg ag 5972 



<210> 10 

<211> 18391 

<212> DNA 

<213> Gallus gallus 

<220> 

<221> misc_f eature 

<222> (1)..(237) 

<223> 5prime matrix (scaffold) attachment region (MAR) 



<220> 

<221> misc_f eature 

<222> (261) . . (1564) 

<223> Sprime matrix (scaffold) attachment region (MAR) 



<220> 

<221> misc_feature 

<222> (1565) . . (1912) 

<223> 5prime matrix (scaffold) attachment region (MAR) 



<220> 

<221> misc_feature 

<222> (1930) (2012) 

<223> Sprime matrix (scaffold) attachment region (MAR) 



<220> 

<221> misc_feature 

<222> (2013) . . (2671) 

<223> Intrinsically curved DNA 



<220> 

<221> misc_feature 

<222> (5848) . . (5934) 

<223> Transcription enhancer 



<220> 

<221> misc_feature 

<222> (9160) . . (9325) 

<223> Transcription enhancer 
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<220> 
<221> 
<222> 
<223> 



mis cofeature 
(9326) . . (9626) 

Negative regulatory element 



<220> 
<221> 
<222> 
<223> 



mis cofeature 

(9621) . . (9660) 

Hormone response element 



<220> 
<221> 
<222> 
<223> 



mis cofeature 
(9680) . . (10060) 
Hormone response element 



<220> 
<221> 
<222> 
<223> 



misc_feature 

(10576) . . (10821) 

Chicken CR1 Repeat Sequence 



<220> 
<221> 
<222> 
<223> 



mis cofeature 
(10926) . . (11193) 
Chicken CR1 Repeat Sequence 



<220> 
<221> 
<222> 
<223> 



mis cofeature 
(11424) . . (11938) 

Lysozyme Proximal Promoter and Lysozyme Signal Peptide 



<220> 
<221> 
<222> 
<223> 



misc__f eature 
(11946) . . (12443) 

human interferon alpha 2b codon-optimized for expression in chick 
ens 



<220> 

<221> mis cofeature 

<222> (12464) . . (18391) 

<223> Chicken Lysozyme 3prime domain 



<400> 10 

tgccgccttc tttgatattc actctgttgt atttcatctc ttcttgccga tgaaaggata 60 

taacagtctg tataacagtc tgtgaggaaa tacttggtat ttcttctgat cagtgttttt 120 

ataagtaatg ttgaatattg gataaggctg tgtgtccttt gtcttgggag acaaagccca 180 

cagcaggtgg tggttggggt ggtggcagct cagtgacagg agaggttttt ttgcctgttt 240 
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tttttttttt tttttttttt aagtaaggtg ttcttttttc ttagtaaatt ttctactgga 300 

ctgtatgttt tgacaggtca gaaacatttc ttcaaaagaa gaaccttttg gaaactgtac 360 

agcccttttc tttcattccc tttttgcttt ctgtgccaat gcctttggtt ctgattgcat 420 

tatggaaaac gttgatcgga acttgaggtt tttatttata gtgtggcttg aaagcttgga 480 

tagctgttgt tacacgagat accttattaa gtttaggcca gcttgatgct ttattttttc 540 

cctttgaagt agtgagcgtt ctctggtttt tttcctttga aactggtgag gcttagattt 600 

ttctaatggg attttttacc tgatgatcta gttgcatacc caaatgcttg taaatgtttt 660 

cctagttaac atgttgataa cttcggattt acatgttgta tatacttgtc atctgtgttt 720 

ctagtaaaaa tatatggcat ttatagaaat acgtaattcc tgatttcctt tttttttatc 780 

tctatgctct gtgtgtacag gtcaaacaga cttcactcct atttttattt atagaatttt 840 

atatgcagtc tgtcgttggt tcttgtgttg taaggataca gccttaaatt tcctagagcg 900 

atgctcagta aggcgggttg tcacatgggt tcaaatgtaa aacgggcacg tttggctgct 960 

gccttcccga gatccaggac actaaactgc ttctgcactg aggtataaat cgcttcagat 1020 

cccagggaag tgcagatcca cgtgcatatt cttaaagaag aatgaatact ttctaaaata 1080 

ttttggcata ggaagcaagc tgcatggatt tgtttgggac ttaaattatt ttggtaacgg 1140 

agtgcatagg ttttaaacac agttgcagca tgctaacgag tcacagcgtt tatgcagaag 1200 

tgatgcctgg atgcctgttg cagctgttta cggcactgcc ttgcagtgag cattgcagat 1260 

aggggtgggg tgctttgtgt cgtgttccca cacgctgcca cacagccacc tcccggaaca 1320 

catctcacct gctgggtact tttcaaacca tcttagcagt agtagatgag ttactatgaa 1380 

acagagaagt tcctcagttg gatattctca tgggatgtct tttttcccat gttgggcaaa 1440 

gtatgataaa gcatctctat ttgtaaatta tgcacttgtt agttcctgaa tcctttctat 1500 

agcaccactt attgcagcag gtgtaggctc tggtgtggcc tgtgtctgtg cttcaatctt 1560 

ttaaagcttc tttggaaata cactgacttg attgaagtct cttgaagata gtaaacagta 1620 

cttacctttg atcccaatga aatcgagcat ttcagttgta aaagaattcc gcctattcat 1680 

accatgtaat gtaattttac acccccagtg ctgacacttt ggaatatatt caagtaatag 1740 

actttggcct caccctcttg tgtactgtat tttgtaatag aaaatatttt aaactgtgca 1800 

tatgattatt acattatgaa agagacattc tgctgatctt caaatgtaag aaaatgagga 1860 

gtgcgtgtgc ttttataaat acaagtgatt gcaaattagt gcaggtgtcc ttaaaaaaaa 1920 

aaaaaaaaag taatataaaa aggaccaggt gttttacaag tgaaatacat tcctatttgg 1980 
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taaacagtta catttttatg aagattacca gcgctgctga ctttctaaac ataaggctgt 2040 

attgtcttcc tgtaccattg catttcctca ttcccaattt gcacaaggat gtctgggtaa 2100 

actattcaag aaatggcttt gaaatacagc atgggagctt gtctgagttg gaatgcagag 2160 

ttgcactgca aaatgtcagg aaatggatgt ctctcagaat gcccaactcc aaaggatttt 2220 

atatgtgtat atagtaagca gtttcctgat tccagcaggc caaagagtct gctgaatgtt 2280 

gtgttgccgg agacctgtat ttctcaacaa ggtaagatgg tatcctagca actgcggatt 2340 

ttaatacatt ttcagcagaa gtacttagtt aatctctacc tttagggatc gtttcatcat 2400 

ttttagatgt tatacttgaa atactgcata acttttagct ttcatgggtt cctttttttc 24 60 

agcctttagg agactgttaa gcaatttgct gtccaacttt tgtgttggtc ttaaactgca 2520 

atagtagttt accttgtatt gaagaaataa agaccatttt tatattaaaa aatacttttg 2580 

tctgtcttca ttttgacttg tctgatatcc ttgcagtgcc cattatgtca gttctgtcag 2640 

atattcagac atcaaaactt aacgtgagct cagtggagtt acagctgcgg ttttgatgct 2700 

gttattattt ctgaaactag aaatgatgtt gtcttcatct gctcatcaaa cacttcatgc 2760 

agagtgtaag gctagtgaga aatgcataca tttattgata cttttttaaa gtcaactttt 2820 

tatcagattt ttttttcatt tggaaatata ttgttttcta gactgcatag cttctgaatc 2880 

tgaaatgcag tctgattggc atgaagaagc acagcactct tcatcttact taaacttcat 2940 

tttggaatga aggaagttaa gcaagggcac aggtccatga aatagagaca gtgcgctcag 3000 

gagaaagtga acctggattt ctttggctag tgttctaaat ctgtagtgag gaaagtaaca 3060 

cccgattcct tgaaagggct ccagctttaa tgcttccaaa.ttgaaggt.gg caggcaactt 3120 

ggccactggt tatttactgc attatgtctc agtttcgcag ctaacctggc ttctccacta 3180 

ttgagcatgg actatagcct ggcttcagag gccaggtgaa ggttgggatg ggtggaagga 3240 

gtgctgggct gtggctgggg ggactgtggg gactccaagc tgagcttggg gtgggcagca 3300 

cagggaaaag tgtgggtaac tatttttaag tactgtgttg caaacgtctc atctgcaaat 3360 

acgtagggtg tgtactctcg aagattaaca gtgtgggttc agtaatatat ggatgaattc 3420 

acagtggaag cattcaaggg tagatcatct aacgacacca gatcatcaag ctatgattgg 3480 

aagcggtatc agaagagcga ggaaggtaag cagtcttcat atgttttccc tccacgtaaa 3540 

gcagtctggg aaagtagcac cccttgagca gagacaagga aataattcag gagcatgtgc 3600 

taggagaact ttcttgctga attctacttg caagagcttt gatgcctggc ttctggtgcc 3660 

ttctgcagca cctgcaaggc ccagagcctg tggtgagctg gagggaaaga ttctgctcaa 3720 
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gtccaagctt cagcaggtca ttgtctttgc ttcttccccc agcactgtgc agcagagtgg 3780 

aactgatgtc gaagcctcct gtccactacc tgttgctgca ggcagactgc tctcagaaaa 3840 

agagagctaa ctctatgcca tagtctgaag gtaaaatggg ttttaaaaaa gaaaacacaa 3900 

aggcaaaacc ggctgcccca tgagaagaaa gcagtggtaa acatggtaga aaaggtgcag 3960 

aagcccccag gcagtgtgac aggcccctcc tgccacctag aggcgggaac aagcttccct 4020 

gcctagggct ctgcccgcga agtgcgtgtt tctttggtgg gttttgtttg gcgtttggtt 4080 

ttgagattta gacacaaggg aagcctgaaa ggaggtgttg ggcactattt tggtttgtaa 4140 

agcctgtact tcaaatatat attttgtgag ggagtgtagc gaattggcca atttaaaata 4200 

aagttgcaag agattgaagg ctgagtagtt gagagggtaa cacgtttaat gagatcttct 4260 

gaaactactg cttctaaaca cttgtttgag tggtgagacc ttggataggt gagtgctctt 4320 

gttacatgtc tgatgcactt gcttgtcctt ttccatccac atccatgcat tccacatcca 4380 

cgcatttgtc acttatccca tatctgtcat atctgacata cctgtctctt cgtcacttgg 4440 

tcagaagaaa cagatgtgat aatccccagc cgccccaagt ttgagaagat ggcagttgct 4500 

tctttccctt tttcctgcta agtaaggatt ttctcctggc tttgacacct cacgaaatag 4560 

tcttcctgcc ttacattctg ggcattattt caaatatctt tggagtgcgc tgctctcaag 4620 

tttgtgtctt cctactctta gagtgaatgc tcttagagtg aaagagaagg aagagaagat 4 680 

gttggccgca gttctctgat gaacacacct ctgaataatg gccaaaggtg ggtgggtttc 4740 

tctgaggaac gggcagcgtt tgcctctgaa agcaaggagc tctgcggagt tgcagttatt 4800 

ttgcaactga tggtggaact ggtgcttaaa gcagattccc taggttccct gctacttctt 4860 

ttccttcttg gcagtcagtt tatttctgac agacaaacag ccacccccac tgcaggctta 4 920 

gaaagtatgt ggctctgcct gggtgtgtta cagctctgcc ctggtgaaag gggattaaaa 4980 

cgggcaccat tcatcccaaa caggatcctc attcatggat caagctgtaa ggaacttggg 5040 

ctccaacctc aaaacattaa ttggagtacg aatgtaatta aaactgcatt ctcgcattcc 5100 

taagtcattt agtctggact ctgcagcatg taggtcggca gctcccactt tctcaaagac 5160 

cactgatgga ggagtagtaa aaatggagac cgattcagaa caaccaacgg agtgttgccg 5220 

aagaaactga tggaaataat gcatgaattg tgtggtggac atttttttta aatacataaa 5280 

ctacttcaaa tgaggtcgga gaaggtcagt gttttattag cagccataaa accaggtgag 5340 

cgagtaccat ttttctctac aagaaaaacg attctgagct ctgcgtaagt ataagttctc 5400 

catagcggct gaagctcccc cctggctgcc tgccatctca gctggagtgc agtgccattt 54 60 
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ccttggggtt tctctcacag cagtaatggg acaatacttc acaaaaattc tttcttttcc 5520 

tgtcatgtgg gatccctact gtgccctcct ggttttacgt taccccctga ctgttccatt 5580 

cagcggtttg gaaagagaaa aagaatttgg aaataaaaca tgtctacgtt atcacctcct 5640 

ccagcatttt ggtttttaat tatgtcaata actggcttag atttggaaat gagagggggt 5700 

tgggtgtatt accgaggaac aaaggaaggc ttatataaac tcaagtcttt tatttagaga 57 60 

actggcaagc tgtcaaaaac aaaaaggcct taccaccaaa ttaagtgaat agccgctata 5820 

gccagcaggg ccagcacgag ggatggtgca ctgctggcac tatgccacgg cctgcttgtg 5880 

actctgagag caactgcttt ggaaatgaca gcacttggtg caatttcctt tgtttcagaa 5940 

tgcgtagagc gtgtgcttgg cgacagtttt tctagttagg ccacttcttt tttccttctc 6000 

tcctcattct cctaagcatg tctccatgct ggtaatccca gtcaagtgaa cgttcaaaca 6060 

atgaatccat cactgtagga ttctcgtggt gatcaaatct ttgtgtgagg tctataaaat 6120 

atggaagctt atttattttt cgttcttcca tatcagtctt ctctatgaca attcacatcc 6180 

accacagcaa attaaaggtg aaggaggctg gtgggatgaa gagggtcttc tagctttacg 624 0 

ttcttccttg caaggccaca ggaaaatgct gagagctgta gaatacagcc tggggtaaga 6300 

agttcagtct cctgctggga cagctaaccg catcttataa ccccttctga gactcatctt 6360 

aggaccaaat agggtctatc tggggttttt gttcctgctg ttcctcctgg aaggctatct 6420 

cactatttca ctgctcccac ggttacaaac caaagataca gcctgaattt tttctaggcc 64 80 

acattacata aatttgacct ggtaccaata ttgttctcta tatagttatt tccttcccca 6540 

ctgtgtttaa ccccttaagg cattcagaac aactagaatc atagaatggt ttggattgga €600 

aggggcctta aacatcatcc atttccaacc ctctgccatg ggctgcttgc cacccactgg 6660 

ctcaggctgc ccagggcccc atccagcctg gccttgagca cctccaggga tggggcaccc 6720 

acagcttctc tgggcagcct gtgccaacac ctcaccactc tctgggtaaa gaattctctt 6780 

ttaacatcta atctaaatct cttctctttt agtttaaagc cattcctctt tttcccgttg 6840 

ctatctgtcc aagaaatgtg tattggtctc cctcctgctt ataagcagga agtactggaa 6900 

ggctgcagtg aggtctcccc acagccttct cttctccagg ctgaacaagc ccagctcctt 6960 

cagcctgtct tcgtaggaga tcatcttagt ggccctcctc tggacccatt ccaacagttc 7020 

cacggctttc ttgtggagcc ccaggtctgg atgcagtact tcagatgggg ccttacaaag 7080 

gcagagcaga tggggacaat cgcttacccc tccctgctgg ctgcccctgt tttgatgcag 7140 

cccagggtac tgttggcctt tcaggctccc agaccccttg ctgatttgtg tcaagctttt 7200 
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catccaccag 


aacccacgct 


tcctggttaa 


tacttctgcc 


ctcacttctg 


taagcttgtt 


7260 


tcaggagact 


tccattcttt 


aggacagact 


gtgttacacc 


tacctgccct 


attcttgcat 


7320 


atatacattt 


cagttcatgt 


ttcctgtaac 


aggacagaat 


atgtattcct 


ctaacaaaaa 


7380 


tacatgcaga 


attcctagtg 


ccatctcagt 


agggttttca 


tggcagtatt 


agcacatagt 


7440 


caatttgctg 


caagtacctt 


ccaagctgcg 


gcctcccata 


aatcctgtat 


ttgggatcag 


7500 


ttaccttttg 


gggtaagctt 


ttgtatctgc 


agagaccctg 


ggggttctga 


tgtgcttcag 


7560 


ctctgctctg 


ttctgactgc 


accattttct 


agatcaccca 


gttgttcctg 


tacaacttcc 


7620 


ttgtcctcca 


tcctttccca 


gcttgtatct 


ttgacaaata 


caggcctatt 


tttgtgtttg 


7680 


cttcagcagc 


catttaattc 


ttcagtgtca 


tcttgttctg 


ttgatgccac 


tggaacagga 


7740 


ttttcagcag 


tcttgcaaag 


aacatctagc 


tgaaaacttt 


ctgccattca 


atattcttac 


7800 


cagttcttct 


tgtttgaggt 


gagccataaa 


ttactagaac 


ttcgtcactg 


acaagtttat 


7860 


gcattttatt 


acttctatta 


tgtacttact 


ttgacataac 


acagacacgc 


acatattttg 


7920 


ctgggatttc 


cacagtgtct 


ctgtgtcctt 


cacatggttt 


tactgtcata 


cttccgttat 


7980 


aaccttggca 


atctgcccag 


ctgcccatca 


caagaaaaga 


gattcctttt 


ttattacttc 


8040 


tcttcagcca 


ataaacaaaa 


tgtgagaagc 


ccaaacaaga 


acttgtgggg 


caggctgcca 


8100 


tcaagggaga 


gacagctgaa 


gggttgtgta 


gctcaataga 


attaagaaat 


aataaagctg 


8160 


tgtcagacag 


ttttgcctga 


tttatacagg 


cacgccccaa 


gccagagagg 


ctgtctgcca 


8220 


aggccacctt 


gcagtccttg 


gtttgtaaga 


taagtcatag 


gtaacttttc 


tggtgaattg 


8280 


cgtggagaat 


catgatggca 


gttcttgctg 


tttactatgg 


taagatgcta 


aaataggaga 


8340 


cagcaaagta 


acacttgctg 


ctgtaggtgc 


tctgctatcc 


agacagcgat 


ggcactcgca 


8400 


caccaagatg 


agggatgctc 


ccagctgacg 


gatgctgggg 


cagtaacagt 


gggtcccatg 


8460 


ctgcctgctc 


attagcatca 


cctcagccct 


caccagccca 


tcagaaggat 


catcccaagc 


8520 


tgaggaaagt 


tgctcatctt 


cttcacatca 


tcaaaccttt 


ggcctgactg 


atgcctcccg 


8580 


gatgcttaaa 


tgtggtcact 


gacatcttta 


tttttctatg 


atttcaagtc 


agaacctccg 


8640 


gatcaggagg 


gaacacatag 


tgggaatgta 


ccctcagctc 


caaggccaga 


tcttccttca 


8700 


atgatcatgc 


atgctactta 


ggaaggtgtg 


tgtgtgtgaa 


tgtagaattg 


cctttgttat 


8760 


tttttcttcc 


tgctgtcagg 


aacattttga 


ataccagaga 


aaaagaaaag 


tgctcttctt 


8820 


ggcatgggag 


gagttgtcac 


acttgcaaaa 


taaaggatgc 


agtcccaaat 


gttcataatc 


8880 


tcagggtctg 


aaggaggatc 


agaaactgtg 


tatacaattt 


caggcttctc 


tgaatgcagc 


8940 
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ttttgaaagc 


tgttcctggc 


cgaggcagta 


ctagtcagaa 


ccctcggaaa 


caggaacaaa 


9000 


tgtcttcaag 


gtgcagcagg 


aggaaacacc 


ttgcccatca 


tgaaagtgaa 


taaccactgc 


9060 


cgctgaagga 


atccagctcc 


tgtttgagca 


ggtgctgcac 


actcccacac 


tgaaacaaca 


9120 


gttcattttt 


ataggacttc 


caggaaggat 


cttcttctta 


agcttcttaa 


ttatggtaca 


9180 


tctccagttg 


gcagatgact 


atgactactg 


acaggagaat 


gaggaactag 


ctgggaatat 


9240 


ttctgtttga 


ccaccatgga 


gtcacccatt 


tctttactgg 


tatttggaaa 


taataattct 


9300 


gaattgcaaa 


gcaggagtta 


gcgaagatct 


tcatttcttc 


catgttggtg 


acagcacagt 


9360 


tctggctatg 


aaagtctgct 


tacaaggaag 


aggataaaaa 


tcatagggat 


aataaatcta 


9420 


agtttgaaga 


caatgaggtt 


ttagctgcat 


ttgacatgaa 


gaaattgaga 


cctctactgg 


9480 


atagctatgg 


tatttacgtg 


tctttttgct 


tagttactta 


ttgaccccag 


ctgaggtcaa 


9540 


gtatgaactc 


aggtctctcg 


ggctactggc 


atggattgat 


tacatacaac 


tgtaatttta 


9600 


gcagtgattt 


agggtttatg 


agtacttttg 


cagtaaatca 


tagggttagt 


aatgttaatc 


9660 


tcagggaaaa 


aaaaaaaaag 


ccaaccctga 


cagacatccc 


agctcaggtg 


gaaatcaagg 


9720 


atcacagctc 


agtgcggtcc 


cagagaacac 


agggactctt 


ctcttaggac 


ctttatgtac 


9780 


agggcctcaa 


gataactgat 


gttagtcaga 


agactttcca 


ttctggccac 


agttcagctg 


9840 


aggcaatcct 


ggaattttct 


ctccgctgca 


cagttccagt 


catcccagtt 


tgtacagttc 


9900 


tggcactttt 


tgggtcaggc 


cgtgatccaa 


ggagcagaag 


ttccagctat 


ggtcagggag 


9960 


tgcctgaccg 


tcccaactca 


ctgcactcaa 


acaaaggcga 


aaccacaaga 


gtggcttttg 


10020 


ttgaaattgc 


agtgtggccc 


agaggggctg. 


caccagtact 


ggattgacca 


cgaggcaaca 


10080 


ttaatcctca 


gcaagtgcaa 


tttgcagcca 


ttaaattgaa 


ctaactgata 


ctacaatgca 


10140 


atcagtatca 


acaagtggtt 


tggcttggaa 


gatggagtct 


aggggctcta 


caggagtagc 


10200 


tactctctaa 


tggagttgca 


ttttgaagca 


ggacactgtg 


aaaagctggc 


ctcctaaaga 


10260 


ggctgctaaa 


cattagggtc 


aattttccag 


tgcactttct 


gaagtgtctg 


cagttcccca 


10320 


tgcaaagctg 


cccaaacata 


gcacttccaa 


ttgaatacaa 


ttatatgcag 


gcgtactgct 


10380 


tcttgccagc 


actgtccttc 


tcaaatgaac 


tcaacaaaca 


atttcaaagt 


ctagtagaaa 


10440 


gtaacaagct 


ttgaatgtca 


ttaaaaagta 


tatctgcttt 


cagtagttca 


gcttatttat 


10500 


gcccactaga 


aacatcttgt 


acaagctgaa 


cactggggct 


ccagattagt 


ggtaaaacct 


10560 


actttataca 


atcatagaat 


catagaatgg 


cctgggttgg 


aagggacccc 


aaggatcatg 


10620 


aagatccaac 


acccccgcca 


caggcagggc 


caccaacctc 


cagatctggt 


actagaccag 


10680 
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gcagcccagg 


gctccatcca 


acctggccat 


gaacacctcc 


agggatggag 


catccacaac 


10740 


ctctctgggc 


agcctgtgcc 


agcacctcac 


caccctctct 


gtgaagaact 


tttccctgac 


10800 


atccaatcta 


agccttccct 


ccttgaggtt 


agatccactc 


ccccttgtgc 


tatcactgtc 


10860 


tactcttgta 


aaaagttgat 


tctcctcctt 


tttggaaggt 


tgcaatgagg 


tctccttgca 


10920 


gccttcttct 


cttctgcagg 


atgaacaagc 


ccagctccct 


cagcctgtct 


ttataggaga 


10980 


ggtgctccag 


ccctctgatc 


atctttgtgg 


ccctcctctg 


gacccgctcc 


aagagctcca 


11040 


catctttcct 


gtactggggg 


ccccaggcct 


gaatgcagta 


ctccagatgg 


ggcctcaaaa 


11100 


gagcagagta 


aagagggaca 


atcaccttcc 


tcaccctgct 


ggccagccct 


cttctgatgg 


11160 


agccctggat 


acaactggct 


ttctgagctg 


caacttctcc 


ttatcagttc 


cactattaaa 


11220 


acaggaacaa 


tacaacaggt 


gctgatggcc 


agtgcagagt 


ttttcacact 


tcttcatttc 


11280 


ggtagatctt 


agatgaggaa 


cgttgaagtt 


gtgcttctgc 


gtgtgcttct 


tcctcctcaa 


11340 


atactcctgc 


ctgatacctc 


accccacctg 


ccactgaatg 


gctccatggc 


cccctgcagc 


11400 


cagggccctg 


atgaacccgg 


cactgcttca 


gatgctgttt 


aatagcacag 


tatgaccaag 


11460 


ttgcacctat 


gaatacacaa 


acaatgtgtt 


gcatccttca 


gcacttgaga 


agaagagcca 


11520 


aatttgcatt 


gtcaggaaat 


ggtttagtaa 


ttctgccaat 


taaaacttgt 


ttatctacca 


11580 


tggctgtttt 


tatggctgtt 


agtagtggta 


cactgatgat 


gaacaatggc 


tatgcagtaa 


11640 


aatcaagact 


gtagatattg 


caacagacta 


taaaattcct 


ctgtggctta 


gccaatgtgg 


11700 


tacttcccac 


attgtataag 


aaatttggca 


agtttagagc 


aatgtttgaa 


gtgttgggaa 


11760 


atttctgtat 


actcaagagg 


gcgtttttga 


caactgtaga 


acagaggaat 


caaaaggggg 


11820 


tgggaggaag 


ttaaaagaag 


aggcaggtgc 


aagagagctt 


gcagtcccgc 


tgtgtgtacg 


11880 


acactggcaa 


catgaggtct 


ttgctaatct 


tggtgctttg 


cttcctgccc 


ctggctgcct 


11940 


tagggtgcga 


tctgcctcag 


acccacagcc 


tgggcagcag 


gaggaccctg 


atgctgctgg 


12000 


ctcagatgag 


gagaatcagc 


ctgtttagct 


gcctgaagga 


taggcacgat 


tttggctttc 


12060 


ctcaagagga 


gtttggcaac 


cagtttcaga 


aggctgagac 


catccctgtg 


ctgcacgaga 


12120 


tgatccagca 


gatctttaac 


ctgtttagca 


ccaaggatag 


cagcgctgct 


tgggatgaga 


12180 


ccctgctgga 


taagttttac 


accgagctgt 


accagcagct 


gaacgatctg 


gaggcttgcg 


12240 


tgatccaggg 


cgtgggcgtg 


accgagaccc 


ctctgatgaa 


ggaggatagc 


atcctggctg 


12300 


tgaggaagta 


ctttcagagg 


atcaccctgt 


acctgaagga 


gaagaagtac 


agcccctgcg 


12360 


cttgggaagt 


cgtgagggct 


gagatcatga 


ggagctttag 


cctgagcacc 


aacctgcaag 


12420 
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agagcttgag 


gtctaaggag 


taaaaagtct 


agagtcgggg 


cggcgcgtgg 


taggtggcgg 


12480 


ggggttccca 


ggagagcccc 


cagcgcggac 


ggcagcgccg 


tcactcaccg 


ctccgtctcc 


12540 


ctccgcccag 


ggtcgcctgg 


cgcaaccgct 


gcaagggcac 


cgacgtccag 


gcgtggatca 


12600 


gaggctgccg 


gctgtgagga 


gctgccgcgc 


ccggcccgcc 


cgctgcacag 


ccggccgctt 


12660 


tgcgagcgcg 


acgctacccg 


cttggcagtt 


ttaaacgcat 


ccctcattaa 


aacgactata 


12720 


cgcaaacgcc 


ttcccgtcgg 


tccgcgtctc 


tttccgccgc 


cagggcgaca 


ctcgcgggga 


12780 


gggcgggaag 


ggggccgggc 


gggagcccgc 


ggccaaccgt 


cgccccgtga 


cggcaccgcc 


12840 


ccgcccccgt 


gacgcggtgc 


gggcgccggg 


gccgtggggc 


tgagcgctgc 


ggcggggccg 


12900 


ggccgggccg 


gggcgggagc 


tgagcgcggc 


gcggctgcgg 


gcggcgcccc 


ctccggtgca 


12960 


atatgttcaa 


gagaatggct 


gagttcgggc 


ctgactccgg 


gggcagggtg 


aaggtgcggc 


13020 


gcgggcggag 


ggacggggcg 


ggcgcggggc 


cgcccggcgg 


gtgccggggc 


ctctgccggc 


13080 


ccgcccggct 


cgggctgctg 


cggcgcttac 


gggcgcgctt 


ctcgccgctg 


ccgcttctct 


13140 


tctctcccgc 


gcaagggcgt 


caccatcgtg 


aagccggtag 


tgtacgggaa 


cgtggcgcgg 


13200 


tacttcggga 


agaagaggga 


ggaggacggg 


cacacgcatc 


agtggacggt 


ttacgtgaag 


13260 


ccctacagga 


acgaggtagg 


gcccgagcgc 


gtcggccgcc 


gttctcggag 


cgccggagcc 


13320 


gtcagcgccg 


cgcctgggtg 


cgctgtggga 


cacagcgagc 


ttctctcgta 


ggacatgtcc 


13380 


gcctacgtga 


aaaaaatcca 


gttcaagctg 


cacgagagct 


acgggaatcc 


tctccgaggt 


13440 


gggtgttgcg 


tcggggggtt 


tgctccgctc 


ggtcccgctg 


aggctcgtcg 


ccctcatctt 


13500 


tctttcgtgc 


c.g.cagtcgt.t^accaaaccgG 


cgtacgagat 


caecgaaacg 


ggctggggcg 


13560 


aatttgaaat 


catcatcaag 


atatttttca 


ttgatccaaa 


cgagcgaccc 


gtaagtacgc 


13620 


tcagcttctc 


gtagtgcttc 


ccccgtcctg 


gcggcccggg 


gctgggctgc 


tcgctgctgc 


13680 


cggtcacagt 


cccgccagcc 


gcggagctga 


ctgagctccc 


tttcccggga 


cgtgtgctct 


13740 


gtgttcggtc 


agcgaggcta 


tcgggagggc 


tttggctgca 


tttggcttct 


ctggcgctta 


13800 


gcgcaggagc 


acgttgtgct 


acgcctgaac 


tacagctgtg 


agaaggccgt 


ggaaaccgct 


13860 


ctcaaactga 


tttattggcg 


aaatggctct 


aaactaaatc 


gtctcctctc 


tttcrciaaatcr 


13920 


ctttagagaa 


ggtctctgtg 


gtagttctta 


tgcatctatc 


ctaaagcact 


tggccagaca 


13980 


atttaaagac 


atcaagcagc 


atttatagca 


ggcacgttta 


ataacgaata 


ctgaatttaa 


14040 


gtaactctgc 


tcacgttgta 


tgacgtttat 


tttcgtattc 


ctgaaagcca 


ttaaaatcct 


14100 


gtgcagttgt 


ttagtaagaa 


cagctgccac 


tgttttgtat 


ctaggagata 


actggtgttt 


14160 
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ccctacagtt 


ctcaagctga 


taaaactctg 


tctttgtatc 


taggtaaccc 


tgtatcactt 


14220 


gctgaagctt 


tttcagtctg 


acaccaatgc 


aatcctggga 


aagaaaactg 


tagtttctga 


14280 


attctatgat 


gaaatggtat 


gaaaatttta 


atgtcaaccg 


agcctgactt 


tatttaaaaa 


14340 


aaattattga 


tggtgctgtg 


tattttggtc 


cttccttaga 


tatttcaaga 


tcctactgcc 


14400 


atgatgcagc 


aactgctaac 


gacgtcccgt 


cagctgacac 


ttggtgctta 


caagcatgaa 


14460 


acagagtgta 


agtgcaaaat 


gaggatacct 


tcgccgaccg 


tcattcacta 


ctaatgtttt 


14520 


ctgtgggatg 


tgatcgtaca 


gtgagtttgg 


ctgtgtgaaa 


tttgaatagc 


ttggtattgg 


14580 


cagtgatgac 


gtgatcgatg 


ccttgcttat 


catgtttgaa 


atgaagtaga 


ataaatgcag 


14640 


cctgctttat 


ttgagatagt 


ttggttcatt 


ttatggaatg 


caagcaaaga 


ttatacttcc 


14700 


tcactgaatt 


gcactgtcca 


aaggtgtgaa 


atgtgtgggg 


atctggagga 


ccgtgaccga 


14760 


gggacattgg 


atcgctatct 


cccatttctt 


ttgctgttac 


cagttcagat 


tttcttttca 


14820 


cctagtcttt 


aattcccagg 


gttttgtttt 


ttccttggtc 


atagtttttg 


tttttcactc 


14880 


tggcaaatga 


tgttgtgaat 


tacactgctt 


cagccacaaa 


actgatggac 


tgaatgaggt 


14940 


catcaaacaa 


acttttcttc 


ttccgtattt 


cctttttttt 


cccccactta 


tcatttttac 


15000 


tgctgttgtt 


gagtctgtaa 


ggctaaaagt 


aactgttttg 


tgctttttca 


ggacgtgtgc 


15060 


tttccaaatt 


actgccacat 


atataaagaa 


aggttggaat 


tttaaagata 


attcatgttt 


15120 


cttcttcttt 


tttgccacca 


cagttgcaga 


tcttgaagta 


aaaaccaggg 


aaaagctgga 


15180 


agctgccaaa 


aagaaaacca 


gttttgaaat 


tgctgagctt 


aaagaaaggt 


taaaagcaag 


15240 


tcgtgaaacc 


atcaactgct 


taaagagtga 


aatcagaaaa 


ctcgaagagg 


atgatcagtc 


15300 


taaagatatg 


tgatgagtgt 


tgacttggca 


gggagcctat 


aatgagaatg 


aaaggacttc 


15360 


agtcgtggag 


ttgtatgcgt 


tctctccaat 


tctgtaacgg 


agactgtatg 


aatttcattt 


15420 


gcaaatcact 


gcagtgtgtg 


acaactgact 


ttttataaat 


ggcagaaaac 


aagaatgaat 


15480 


gtatcctcat 


tttatagtta 


aaatctatgg 


gtatgtactg 


gtttatttca 


aggagaatgg 


15540 


atcgtagaga 


cttggaggcc 


agattgctgc 


ttgtattgac 


tgcatttgag 


tggtgtagga 


15600 


acattttgtc 


tatggtcccg 


tgttagttta 


cagaatgcca 


ctgttcactg 


ttttgttttg 


15660 


tattttactt 


tttctactgc 


aacgtcaagg 


ttttaaaagt 


tgaaaataaa 


acatgcaggt 


15720 


tttttttaaa 


tatttttttg 


tctctatcca 


gtttgggctt 


caagtattat 


tgttaacagc 


15780 


aagtcctgat 


ttaagtcaga 


ggctgaagtg 


taatggtatt 


caagatgctt 


aagtctgttg 


15840 


tcagcaaaac 


aaaagagaaa 


acttcataaa 


atcaggaagt 


tggcatttct 


aataacttct 


15900 
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ttatcaacag 


ataagagttt 


ctagccctgc 


atctactttc 


acttatgtag 


ttgatgcctt 


15960 


tatattttgt 


gtgtttggat 


gcaggaagtg 


attcctactc 


tgttatgtag 


atattctatt 


16020 


taacacttgt 


actctgctgt 


gcttagcctt 


tccccatgaa 


aattcagcgg 


ctgtaaatcc 


16080 


ccctcttctt 


ttgtagcctc 


atacagatgg 


cagaccctca 


ggcttataaa 


ggcttgggca 


16140 


tcttctttac 


tgctttgaga 


ttctgtgttg 


cagtaacctc 


tgccagagag 


gagaaaagcc 


'16200 


ccacaaacct 


catccccttc 


ttctatagca 


atcagtatta 


ctaatgcttt 


gagaacagag 


16260 


cactggtttg 


aaacgtttga 


taattagcat 


ttaacatggc 


ttggtaaaga 


tgcagaactg 


16320 


aaacagctgt 


gacagtatga 


actcagtatg 


gagacttcat 


taagacaaac 


agctgttaaa 


16380 


atcaggcatg 


tttcattgag 


gaggacgggg 


caacttgcac 


cagtggtgcc 


cacacaaatc 


16440 


cttcctggcg 


ctgcagacca 


atttttctgg 


cattctgact 


gccgttgctg 


ctggtcacag 


16500 


agagcaacta 


tttttatcag 


ccacaggcaa 


tttgcttgta 


gtattttcca 


agtgttgtag 


16560 


gtaagtataa 


atgcatcggc 


tccagagcac 


tttgagtata 


cttattaaaa 


acataaatga 


16620 


aagacaaatt 


agctttgctt 


gggtgcacag 


aacattttta 


gttccagcct 


gctttttggt 


16680 


agaagccctc 


ttctgaggct 


agaactgact 


ttgacaagta 


gagaaactgg 


caacggagct 


16740 


attgctatcg 


aaggatcctt 


gttaacaaag 


ttaatcgtct 


tttaaggttt 


ggtttattca 


16800 


ttaaatttgc 


ttttaagctg 


tagctgaaaa 


agaacgtgct 


gtcttccatg 


caccaggtgg 


16860 


cagctctgtg 


caaagtgctc 


tctggtctca 


ccagcctttt 


aattgccggg 


attctggcac 


16920 


gtctgagagg 


gctcagactg 


gcttcgtttg 


tttgaacagc 


gtgtactgct 


ttctgtagac 


16980 


atggccggtt 


tctetcctgc 


agcttatgaa 


ac-tgt-tcaea 


ctgaacacac 


tggaacaggt 


17040 


tgcccaagga 


ggccgtggat 


gccccatccc 


tggaggcatt 


caaggccagg 


ctggatgtgg 


17100 


ctctgggcag 


cctggtctgg 


tggttggcga 


tcctgcacat 


agcagcgggg 


ttgaaactcg 


17160 


atgatcactg 


tggtcctttt 


caacccaggc 


tattctatga 


ttctatgatt 


caacagcaaa 


17220 


tcatatgtac 


tgagagagga 


aacaaacaca 


agtgctactg 


tttgcaagtt 


ttgttcattt 


17280 


ggtaaaagag 


tcaggtttta 


aaattcaaaa 


tctgtctggt 


tttggtgttt 


tttttttttt 


17340 


atttattatt 


tctttggggt 


tctttttgat 


gctttatctt 


tctctgccag 


gactgtgtga 


17400 


caatgggaac 


gaaaaagaac 


atgccaggca 


ctgtcctgga 


ttgcacacgc 


tggttgcact 


17460 


cagtagcagg 


ctcagaactg 


ccagtctttc 


cacagtatta 


ctttctaaac 


ctaattttaa 


17520 


tagcgttagt 


agacttccat 


cactgggcag 


tgcttagtga 


atgctctgtg 


tgaacgtttt 


17580 


acttataagc 


atgttggaag 


ttttgatgtt 


cctggatgca 


gtagggaagg 


acagattagc 


17640 
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tatgtgaaaa 


gtagattctg 


agtatcgggg 


ttacaaaaag tatagaaacg atgagaaatt 


17700 


cttgttgtaa 


ctaattggaa 


tttctttaag 


cgttcactta tgctacattc atagtatttc 


17760 


catttaaaag 


taggaaaagg 


taaaacgtga 


aatcgtgtga ttttcggatg gaacaccgcc 


17820 


ttcctatgca 


cctgaccaac 


ttccagagga 


aaagcctatt gaaagccgag attaagccac 


17880 


caaaagaact 


catttgcatt 


ggaatatgta 


gtatttgccc tcttcctccc gggtaattac 


17940 


tatactttat 


agggtgctta 


tatgttaaat 


gagtggctgg cactttttat tctcacagct 


18000 


gtggggaatt 


ctgtcctcta 


ggacagaaac 


aattttaatc tgttccactg gtgactgctt 


18060 


tgtcagcact 


tccacctgaa 


gagatcaata 


cactcttcaa tgtctagttc tgcaacactt 


18120 


ggcaaacctc 


acatcttatt 


tcatactctc 


ttcatgccta tgcttattaa agcaataatc 


18180 


tgggtaattt 


ttgttttaat 


cactgtcctg 


accccagtga tgaccgtgtc ccacctaaag 


18240 


ctcaattcag 


gtcctgaatc 


tcttcaactc 


tctatagcta acatgaagaa tcttcaaaag 


18300 


ttaggtctga 


gggacttaag 


gctaactgta 


gatgttgttg cctggtttct gtgctgaagg 


18360 


ccgtgtagta 


gttagagcat 


tcaacctcta 


g 


18391 



<210> 11 

<211> 586 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MDOT artificial promoter 

<400> 11 



gtaccgggc.c 


ccccctcgag gtgaatatcc 


aagaatgcag aactgcatgg aaagcagagc 


60 


tgcaggcacg 


atggtgctga 


gccttagctg 


cttcctgctg ggagatgtgg atgcagagac 


120 


gaatgaagga 


cctgtccctt 


actcccctca 


gcattctgtg ctatttaggg ttctaccaga 


180 


gtccttaaga 


ggtttttttt 


ttttttggtc 


caaaagtctg tttgtttggt tttgaccact 


240 


gagagcatgt 


gacacttgtc 


tcaagctatt 


aaccaagtgt ccagccaaaa tcgatgtcac 


300 


aacttgggaa 


ttttccattt 


gaagcccctt 


gcaaaaacaa agagcacctt gcctgctcca 


360 


gctcctggct 


gtgaagggtt 


ttggtgccaa 


agagtgaaag gcttcctaaa aatgggctga 


420 


gccggggaag 


gggggcaact 


tgggggctat 


tgagaaacaa ggaaggacaa acagcgttag 


480 


gtcattgctt 


ctgcaaacac 


agccagggct 


gctcctctat aaaaggggaa gaaagaggct 


540 


ccgcagccat 


cacagaccca 


gaggggacgg 


tctgtgaatc aagctt 


586 



<210> 12 
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<211> 11 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> SV40 terminator 
<400> 12 

Cys Gly Gly Pro Lys Lys Lys Arg Lys Val Gly 
15 10 



<210> 13 

<211> 12 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer SaltoNotI 

<400> 13 
tcgagcggcc gc 



<210> 14 
<211> 83 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 14 

atggctttga cctttgcctt actggtggct ctcctggtgc tgagctgcaa gagcagctgc 60 
tctgtgggct gcgatctgcc tea 83 

<210> 15 
<211> 100 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 15 

gacccacagc ctgggcagca ggaggaccct gatgetgetg gctcagatga ggagaatcag 60 
cctgtttagc tgectgaagg ataggcacga ttttggcttt 10 

<210> 16 
<211> 62 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 



<400> 16 
cti 

tg 



ctcaagagga gtttggcaac cagtttcaga aggctgagac catccctgtg ctgcacgaga 60 



<210> 17 
<211> 94 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223>primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

tcSagcagat ctttaacctg tttagcacca aggatagcag cgctgcttgg gatgagaccc 60 
tgctggataa gttttacacc gagctgtacc agca 

<210> 18 
<211> 77 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

ctgaacgatc tggaggcttg cgtgatccag- ggcgtgggcg tgaccgagac ccctctgatg 60 
aaggaggata gcatcct 

<210> 19 
<211> 82 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 



gctgtgagga agtactttca gaggatcacc ctgtacctga aggagaagaa gtacagccct 60 
tgcgcttggg aagtcgtgag gg 

<210> 20 
<211> 65 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 20 

ctgagatcat gaggagcttt agcctgagca ccaacctgca agagagcttg aggtctaagg 60 
agtaa 65 

<210> 21 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 21 

cccaagcttt caccatggct ttgacctttg cctt 34 

<210> 22 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 



<400> 22 

atctgcctca gacccacag 19 

<21G->-~2~3 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 23 

gattttggct ttcctcaaga ggagtt 26 

<210> 24 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 



-39- 



WO 03/024199 



<400> 24 

gcacgagatg atccagcaga t 

<210> 25 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 25 

atcgttcagc tgctggtaca 

<210> 26 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 26 

cctcacagcc aggatgctat 

<210> 27 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 27 

atgatctcag ccctcacgac 

<210> 28 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 28 

ctgtgggtct gaggcagat 

<210> 29 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 29 

aactcctctt gaggaaagcc aaaatc 26 

<210> 30 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 30 

atctgctgga tcatctcgtg c 21 

<210> 31 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 31 

tgctctagac tttttactcc ttagacctca agctct 36 



<210> 32 
<211> 25 
<212>- DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the synthesis of the MOOT promoter 
<400> 32 

tcactcgagg tgaatatcca agaat 25 

<210> 33 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the synthesis of the MDOT promoter 
<400> 33 

gagatcgatt ttggctggac acttg 25 

<210> 34 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> primer used in the synthesis of the MDOT promoter 
<400> 34 

cacatcgatg tcacaacttg ggaat 

<210> 35 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the synthesis of the MDOT promoter 
<400> 35 

tctaagcttc gtcacagacc gtccc 



PCT/US02/30156 



25 



25 
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